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Chapter 1 



INTRODUCTION 



In this chapter the problem of understanding the human processing 
of language is first discussed. As a small effort toward developing such 
an understanding through psycholinguistic experimentation, the present 
study is next outlined. Finally three related psycholinguistic studies 

which helped the writer in the design and analysis of the present inves- 
tigation are summarized. 

1.1 Defining the Problem 

One primary use of natural language is to communicate ideas. To do 
so the ideas must be expressed in a well-structured way, according to a 
set of generally accepted rules, namely the grammar of a language. The 
set of rules is learned by children in a language community where people 
speak to each other. But the precise specification of these rules is very 
difficult. The grammar, once in the mind, allows a speaker to encode his 
ideas so as to be understood by others in the same community. Likewise, 
the grammar may be used to detect, and sometimes also correct, any deviant 
usage. The deviant usages may be of many different types, among which are 
the use of sentences with more than one interpretation. 

Everyday speech abounds with ambiguous sentences. Even written mate- 
rial, when parsed by a computer program (usually based on a context free 
grammar) , has been shown to contain many ambiguous constructions [Kuno and 
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Oettinger, 1963; Robinson, 1962]. To native speakers, however, these 
ambiguous constructions usually convey only one meaning. It is not known 
whether a native speaker without any linguistic sophistication would ever 
have noticed the ambiguity of such sentences. Even less known is whether 
a native speaker would analyze a sentence syntactically and, if ambiguous, 
would then apply i. ,s "semantic knowledge" to resolve the ambiguity. What 
is known is the fact that native speakers ^ often resolve syntactically 
ambiguous sentences, even when they occur as isolated sentences. However, 
for the theoretical analysis of sentences, linguists often make the sim- 
plifying assumption that a syntactic analysis precedes a semantic analy- 
sis [Chomsky, 1957, 1965; Katz and Fodor, 1963; Lamb, 1965a, 1965b]. For 
purposes of language processing by a computer , it is almost a necessity 
to make such an ordering assumption, since a computer executes its in- 
structions serially (at least with the present generation of computers) 
[Simmons, 1965; Garvin, 1963]. 

The writer believes that the technique of psycholinguistic experimen- 
tation, as well as formal analysis, can be applied to discover some vari- 
ables which allow the human to resolve ambiguous sentences. Of course the 
content of any manageable experiment must necessarily be narrow. (Unfor- 
tunately when the content is narrowea, the study is often criticized as 
trivial.) In this research study, the problem of ambiguity is confined to 
situations in which a key pronoun in a sentence has more than one possible 
antecedent. The study is part of an overall plan in the Language Develop- 
ment Program of the Center for Human Growth and Development to investigate 
the development of language behavior from children to adults. The objec- 
tive in this study is to discover some parameters that control the ability 
to resolve ambiguous pronominal reference in different age groups. It 
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is hoped that once we have discovered the parameters that allow the reso- 
lution of an ambiguous pronominal reference to a certain degree, it will 
be possible to investigate the more difficult question of how the contex- 
tual information is used to resolve the ambiguity. 

The experiment involves a set of 176 sentences, each of which con— 
P^o^ouns with ambiguous antecedents. These sentences were given to 
experimental subjects (henceforth to be abbreviated by Sy or Ss if plural) 
of four age groups (fifth, seventh, and eighth grade children and college 
sophomores) who were asked to judge the most appropriate referent in each 
sentence. The controlling variables for this experiment consist of three 
sentence types, four grammatical forms, and four verb-pair classes. The 
reader is referred to Chapter 2 (EXPERIMENTAL DESIGN) which lists the vari- 
ables, discusses the rationale for choosing these variables and the pro- 
cedure for generating the 176 sentences from the three variables. 

The resulting data will be analyzed by three mathematical models. Each 
model explains certain aspects of the observed behavior. The Bernoulli 
trials model partitions the entire set of sentences into two mutually ex- 
clusive sets, the unstarred set which is interpreted as containing ambig- 
uous sentences and the starred set which is interpreted as containing un- 
ambiguous sentences. This model suggests a possible definition of ambig- 
uity. The next model, the k-limited transducer model, characterizes the 
human as a sentence processor with limited memory in which he first decodes 
the sentence into its deep structure and then makes an assignment for the 
pronominal referent. This model explains the degenerative behavior, i.e., 
tendency toward random responding, when sentences become more complex svn— 
tactically or when £s are younger, other variables remaining constant. 
Finally, the information transmission model takes the response data directly 
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to compute various components of transmitted information (defined in 
Chapter 4.3). These components measure the ability to predict the pro- 
nominal referent by knowing certain combinations of variables. This 
model purports to explain why some combinations of variables cause a 

uniform response to one interpretation while other combinations cause a 
random response. 

The proposed models are satisfactory only to the extent that they 
have functional equivalents, i.e., that the models account for the ob- 
served behavior. Although we can never hope to know what is happening 
within the central nervous system in the process of resolving pronominal 
ambiguity, the models will allow us to predict other unobserved behaviors 
which may be tested in future studies. The ultimate objective for such 
an investigation is to discover some of the heuristic procedures the hu- 
man uses in processing and understanding sentences in natural language. 

If the computer is ever to become "sophisticated" enough in language pro- 
cessing to abstract scientific papers, retrieve key information and pos- 
sibly even translate across languages, such human abilities must be encoded 
in the machine. 

1*2 Three R elated Psvcholinguistic Studies 

The first experiment to be reviewed is that of Kaplan [1949], His 
study was an attempt to demonstrate that contextual information could be 
used to reduce the number of interpretations of a key word. Miller and 
Isard s [1964] study investigated the human capacity to recall sentences 
with self-embedded clauses and considered the relation of self-embedding 
to the theoretical model of push-down automata. The study by Fraser, 

Bellugi and Brown [1963] investigated the imitation, comprehension and 
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production of some ten grammatical features in three-year-old children. 

To this last study, McNeill [1965] postulated three separate memory spans 
to account for both the results of Fraser, et al, and of Ervin [1964] who 
thought that her results were contradictory to those of Fraser, et al. 

These studies served as guideposts from the design to the analysis of the 
present study. 

a. Kaplan [1949]. A group of seven "translators" (some professional 
and some high school graduates) were presented with single words along 
with a set of ten meanings to be judged as appropriate, i.e., the sense 
in which they may be conventionally used. Then each translator was pre- 
sented with the word in one of the following seven kinds of contexts: one 

word preceding (PI) , one word following (FI) , both one word preceding and 
one word following (Bl) , two words preceding (P2) , two words following (F2) , 
both two words preceding and two words following (B2) and the entire sen- 
tence. The task of each translator was to check those senses for which 
the test words were appropriate in each given context. 

Kaplan selected 140 sentences randomly from some ten pure and ap- 
plied mathematics books. The 140 key words for which the ambiguity waS 
to be determined were the so-called "content words", i.e., nouns, verbs 
and adjectives. They were selected from the 140 sentences which varied 
in length from 15 to 40 words. For each key word, ten senses were 
selected from the fifth edition of Webster’s Collegiate Dictionary . If 
less than ten senses were given by the dictionary, arbitrary senses were 
contrived so that every key word was given along with ten senses. 

Each key word was scored according to the total number of ascriptions 
of acceptable senses plus the total number of denials of contrived senses. 



o 



6 



Kaplan defined the percent reduction for a given context as the ratio of 
the score with that context to the score without context. However, the 
writer feels that the term "reduction" is better used to define the ratio 
of the number of senses that was eliminated in a given context to the num- 
ber without context. Hence in the following paragraph, the term reduction 
will be used according to this new definition which differs from Kaplan* s. 
If Kaplan gave a value of n% reduction to a word, then the new definition 
would give a value of (100 - n)%. 

Kaplan found that PI was the least effective in reducing the number 
of senses (25%) , while the sentence context reduced the largest number of 
the senses (63%). However, the contexts P2, F2 and B2 were not signifi- 
cantly different from that of the sentence, which suggested that any two- 
words provided a context as good as the entire sentence. This classical 
experiment demonstrated that the reduction of senses for any key word was 
possible through its immediate neighboring words. It did not, however, 
indicate which parameters the human translators used to reduce the senses. 

In the present experiment, the writer also demonstrates that reduc- 
tion from two to one sense of the key word is often possible. Since he 
could manipulate the variables in the experiment one at a time, it was pos- 
sible to discover those combinations of variables which made the reduction 
possible. These variables will be proposed as among the parameters we use 
to resolve ambiguity. 

Miller and Isard T19641 . Twenty-four adult Ss were tested for 
free recall of six sentences, each 22 words long but differing in the 
degree of self-embedding in their phrase structures. These sentences were 
recorded on tape and played over earphones to one S at a time. As soon as 
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a sentence was heard, the ^ attempted to repeat it verbatim; this was re- 
corded for analysis later. Each sentence was presented and repeated five 
times in this way. This continued until the ^ had memorized all six sen- 
tences. Six different orders of presentation of the sentences were pre- 
pared, in counterbalanced design, and four ^s learned the sentences in each 
order. The responses of the ^s were scored in terms of the number of words 
recalled in the same order as the words In the original sentence. 

The different degrees of self-embedding are Illustrated by the fol- 
lowing sentences: 

(0) She liked the man that visited the jeweler that made the 
ring that won the prize that was given at the fair. 

(1) The man that she liked visited the jeweler that made the 
ring that won the prize that was given at the fair. 

(2) The jeweler that the man that she liked visited made the 
ring that won the prize that was given at the fair. 

(3) The ring that the jeweler that the man that she liked 
visited made won the prize that was given at the fair. 

(4) The prize that the ring that the jeweler that the man that 
she liked visited made won was given at the fair. 

(random) Won given liked that that the fair man made visited 
prize the at the the she that jeweler was the ring that. 

The results of the 24 S.s are shown in Figure 1-1. The performance 
curves for zero and one degree of self-embedding are quite close, as are 
the curves for three and four embeddings, with the performance on the doubly 
embedded sentences falling somewhere in between. Miller and Xsard explained 
the results through analogy with computer processing of subroutines. That 
is, whenever a subroutine is called during the main routine, the main rou- 
tine is temporarily terminated in order to execute the subroutine. But the 
point of return to the main routine must be stored so that the main routine 
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can continue after the termination of the subroutine. But if within the 
subroutine, the same subroutine is called, a special kind of address stor- 
age is needed to store the return address. If this capacity for recursive 
calling of the same subroutine is provided in a computer program to a 




Figure 1-1: Relation of percent correctly recalled vs. successive trials. 

depth of n, then the computer will be able to process (equivalent to 
correct recall in the human learning experiment) self-embedding sentences 
to a depth of n + 1. The results strongly suggest that this recursive 
calling capability is limited in human Ss; in fact n is one (i.e., two 
embeddings) if we require a 90% correctly recalled criterion after five 
trials . 

Miller and Isard*s experiment conclusively demonstrated the distinc- 
tion between grammatically acceptable sentences and psychologically accept- 
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able sentences. Hence for practical considerations, such as writing a 
computer sentence parser, it is not necessary to consider sentences of 
indefinite degree of embedding; some small number is satisfactory, such 
as, say, ten. 

In the present study, the k-limited transducer model will be defined 
and interpretations supplied to explain the degenerative behavior (i.e., 
tendency toward random responding) as sentences become more complex syn- 
tactically or for younger ^s. Some studies [Slobln, 1966; McNeill, 1965; 
Brown and Fraser, 1964] have shown that children as young as four years 
old have acquired the competence of both the passive and negative trans- 
formations. Hence from linguistic competence considerations, there should 
not be any degeneration due to either or both the passive and negative 
transformations in sentences, either within one age group or across the 
four age groups. However, it is apparent that the performance of the ^s, 
especially the younger children, were far inferior to their expected com- 
petence. Memory limitations are assumed to be the major factors for the 
Inferior performance, as explained in more detail in the following section. 

c. Fraser. Bellugl and Brown [1963] . A group of three-year-old ^s 
were tested on three tasks: Imitation, comprehension and performance. 

Fairs of sentences were designed so that they differed only on one gram- 
matical feature. Associated with each pair of sentences, there was a 
pair of contrasting pictures which distinguished the particular feature. 
For example, one pair of sentences, "The sheep is jumping" vs. "The sheep 
are jumping" was accompanied by a pair of contrasting pictures, one of 
which depicted a single sheep jumping over a fence while a second sheep 
looked on and the other depicted two sheep jumping over the fence. This 
pair of sentences differed only in whether the auxiliary ^ was singular 
or plural and the two pictures differed only in whether there was one or 
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two sheep performing the jumping. The sentence pairs with their associated 
contrasting pictures were used to control the nerformance of a child’s 
grammar. Altogether ten different grammatical features were tested, for 
example, singular-plural marked by inflection ("the boy draws" vs. "the 
boys draw"), present progressive-past tense ("the paint is spilling" vs. 

"the paint spilled"), subject-object in the active ("the boy hit the girl" 
vs. "the girl hit the boy"), subject-object in the passive ("the car is 
bumped by the train" vs. "the train is bumped by the car"), etc. 

In the comprehension task, the experimenter (E) read a pair of sen- 
tences to the child while pointing to the respective pictures. Then E 
repeated one of the two sentences and asked the child to point to the ap- 
propriate picture. He was scored right or wrong according to the picture 
to which he pointed. In the production task, the E again read the pair of 
sentences while pointing to each picture respectively. Then the E pointed 
to one of the pictures and asked the child to say the appropriate sentence. 
He was scored right or wrong only according to whether his response included 
the particular grammatical feature. In the imitation task, no pictures 
were used. The child was asked to repeat the J|’s model sentence and again 
was scored right or wrong according to whether his response included the 
particular grammatical feature. 

The results showed that comprehension exceeded production, often by 
a large margin, in every grammatical contrast, while imitation exceeded 
comprehension on every grammatical contrast except one, again often by a 
large margin. Fraser, et al explained the result of comprehension exceeding 
production by the fact that production places a greater load on a child’s 
memory. The imitation task depended only on perceptual-motor skills which 
do not operate through the meaning system, and hence it was the easiest of 
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the three tasks. McNeill [1965] preferred to explain the results by pos- 
tulating three separate memory spans, the longest one for phonological 
production (equivalent to the task of imitation) , the next shorter one 
for grammatical comprehension, and the shortest one for grammatical pro- 
duction. Whenever a sentence is shorter than the particular memory span, 
the corresponding performance is achieved correctly, but when a sentence 
exceeds a particular memory span, the corresponding performance cannot 
be achieved correctly. In the experiment by Fraser et al., the sentences 
were all short (with an average length of four morphemes and a maximum of 
eight morphemes) $ hence it was plausible to assume that the shorter sen- 
tences were within the grammatical comprehension span while the longer 
ones were within the phonological span. The only advantage of this expla- 
nation by McNeill is that it is possible to account also for the findings 
of Ervin [1964] who found that imitation and production were not different 
in her two-year-old ^s. The reason given is that Ervin's children were 
imitating adult speech, which was generally quite long, and that two-year- 
old children could easily be assumed to have shorter memory spans. Hence 
for these younger children, if all the sentences exceeded even the phono- 
logical span, then neither imitation nor production could be achieved ac- 
curately. Therefore, Ervin's findings were compatible with the findings 
of Fraser et al. through a model with three different memory spans. 

Of course the specification of the size and unit of the respective 
postulated memory spans is a separate and very difficult question. Since 
it is possible for a human to encode his message to overcome his memory 
limitation, it is almost impossible to know the size of the span without 
knowing the encoding scheme. Yet the encoding scheme may differ depending 
on the particular task and message. If McNeill's postulated memory spans 
are tenable, then there are probably other spans for other tasks. 



o 



In this experiment, two memory spans are postulated to account for 
the observed degenerative behavior as sentences become more complex syn- 
tactically or as ^s become younger. The model used is a concatenation 
of two k-limited transducers, one for decoding the input sentence into 
its deep structure and the other for taking the deep structure as input 
to assign the pronominal referent. The k for the k-limited transducer 
reflects the maximum number of input symbols that the transducer can pro- 
cess, and upon interpretation, the k reflects the memory span of the 
When a sentence falls within a s decoding span and in addition the de- 
coded message is within the £*s assignment span, then the referent will 
be assigned correctly. However, as a sentence becomes more complex syn- 
tactically, the chance of exceeding a ^*s decoding span becomes greater. 
Likewise as a ^ becomes younger, it is reasonable to assume that both the 
decoding and assignment spans become shorter, and hence the chance for a 
sentence to have an incorrectly assigned referent becomes greater. 



Chapter 2 



EXPERIMENTAL DESIGN 



This chapter describes the construction of the test sentences. The 
controlling variables and the rationale for these variables will be pre*” 
sented first; then the procedure for generating the test sentences will 

be explained. 

2.1 Controlling Variables 

experiment consisted of a set of 176 sentences y all of which 

could have two interpretations. The sentences were constructed from the 

following three variables: three sentence types, four grammatical forms, 

and four sets of verb-pairs. 

The three sentence types are; 

John Yed Bill and/but he Xed him. 

John Yed Bill and /but he Xed him back. 

John and Bill Yed each other and /but he Xed him. 

The four grammatical forms are: 

Active (A), e.g., he Xed him. 

Negative (A), e.g., he did not X him. 

Passive (P) , e.g., he was Xed by him. 

Negative-passive (P) , e.g., he was not Xed by him. 

The four classes of verb-pairs are: 

Nonsense verbs (NS), i.e., Y and X are consonant— vowel— consonant 
syllables . 

Identical verbs (ID), i.e., Y and X are the same verb, from the 
set: {tease, hurt, strike, punch}. 
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Similar verbs (SI), i.e., Y and X are both outward action 
verbs, from the set of ordered pairs: {<hit, kick>, 

<push, shove>, <help, assist>, <frighten, scare>}. 

Logical reversal verbs (LR) , i.e., Y is an inward action 
verb, and X is an outward action verb, from the set 
of ordered pairs: {<understand, answer>, <hear, call>, 

<remember, phone>, <recognize, invite>}. 

For example, the following three sentences were constructed by using 

the nonsense verb-pairs: 

Bill qeged John and he yeced him. [A-A] 

John was zeved by Bill and he pehed him back. [P-A] 

Bill and John did not kez each other but he was qemed 
by him. [A-P] 

The first sentence was constructed by using the first sentence type, with 
the first clause in the active grammatical form (A) and the second clause 
also in the active grammatical form (A) . The second sentence was con- 
structed by using the second sentence type, with the first clause in the 
passive grammatical form (P) and the second clause in the active grammat- 
ical form (A) . The third sentence was constructed by using the third sen- 
tence type, with the first clause in the negative grammatical form (A) and 
the second clause in the passive grammatical form (P) . 

Further, the following three sentences were constructed by using the 
real verb-pairs for each of the three sentence frames: 

John did not hurt Bill but he hurt him. [A-A] 

Bill did not help John but he was assisted back by him. [A-P] 

John and Bill were not understood by each other and he did not 
answer him. [P-A] 

The first sentence used one member from the class of identical verb-pairs 
in its A-A construction; the second sentence used one member from the 
class of similar verb-pairs in its A-P construction; and the third sentence 
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used one member from the class of logical reversal verb-pairs in its 
P-A construction. 

2.2 Terminology 

The term sentence type will be applied to the three sentence struc- 
tures where X and Y can be any verbs, and the two clauses can be in 
any one of four grammatical forms. The three sentence types will be 
abbreviated respectively as the He Xed him type. He Xed him back type and 
Each other type. A sentence frame will be any one of the sentence types 
with a particular set of verb-pairs. Hence there are four sentence frames 
for each sentence type. When a specific sentence frame is referred to, 
the particular sentence type will be preceded by an adjective which de- 
scribes one of the four verb-pair classes. For example, the nonsense He 
Xed him frame is that frame in which the verbs X and Y are nonsense 
syllables in the He Xed him sentence type. Since every sentence is con- 
structed out of two clauses, each can be in one of four grammatical forms, 
a specific sentence of a sentence type or sentence frame will be referred 
to by its grammatical forms. For example, the A-P sentence of the He 
Xed him type is John Yed Bill and he was Xed by him , and the A-P sentence 
of the identical He Xed him back frame is John did not strike Bill but he 
was struck back by him . 

2.3 Rationale for the Variables 

Of the three sentence types, the Each other type was Intended to 
serve as a basis of comparison. It was expected that any sentence of 
this type would be so ambiguous that it would produce random responding 
and thus provide a base line against which to compare responses to the 
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other two sentence types. The He Xed him back type was used because it 
contains probably the most frequently exposed compound sentences for 
children. The referent for the pronoun he should be unambiguous because 
the word "back" provides a semantic context which leads the reader to 
switch the subjects of the two verbs in the two respective clauses. Thus 
if John Yed Bill , then it must be Bill who Xed him back . In the He Xed 
him type, the most plausible interpretation is also to switch the subjects 
of the two verbs in the two respective clauses. Here the subtle contex- 
tual information comes from the structure of the two clauses; that is, if 
John Yed Bill and John Xed him are implied, then there exists another con- 
struction which would better express the two relations, as for example, 
the construction John Yed and Xed Bill . However, if the verbs X and Y 
were the same, then the construction John Xed and Xed Bill (e.g., in the 
context, John kicked and kicked and kicked . . . Bill) has certain poetic 
flavor for emphasizing the action, but is rarely used in ordinary writing 
or conversation. Hence in this situation, the more plausible interpre- 
tation would be John Xed Bill and Bill Xed him . 

For each of the two clauses in a sentence type, four grammatical forms 
were used : active (A) , negative (A) , passive (P) , and negative passive 
(P) . Hence each sentence frame had a set of 16 individual sentences. 
These sentences were different ways of expressing the same idea, but were 
expected to vary in the degree of ambiguity of the pronominal referent (as 
measured by the non-uniformity of response from groups of ^s) . It was not 
known initially whether the use of the conjunction and or but would alter 
the degree of ambiguity, but the following rule was consistently used: 
and was the conjunction whenever both clauses were affirmative (i.e., A 
or P), or both negative (i.e., A or P) , and but was used whenever one 
clause was affirmative and the other was negative. 
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Since every sentence was made up of two clauses, it seemed intuitive 
that altering the relationship between the two verbs could alter the inter- 
pretation of the sentence, and hence the choice of the pronominal referent. 
Since there were two persons and two verbs in each sentence, there were 
four possible ways of expressing the two actions in the sentence types He 
Xed him and He Xed him back s 

1) John Yed Bill and John Xed Bill (back) . 

2) John Yed Bill and Bill Xed John (back) . 

3) Bill Yed John and Bill Xed John (back) . 

4) Bill Yed John and John Xed Bill (back) . 

For any given sentence, the first clause was already specified, hence the 
number of interpretations was reduced to two (of course one could intro- 
duce other possibilities by equating him to someone other than John or Bill .) 

Three classes of verb-pairs were taken from a previous study [Chai, 

1966] in which the ^s were instructed to insert two real verbs before they 
decided on the interpretation of the pronoun he. Each class of verb-pairs 
made one interpretation more plausible than the other. The identical verb- 
pairs made interpretations one and three almost impossible; the logical 
reversal verb-pairs made interpretations one and three most likely; while 
the similar verb-pairs were less determinant. 

The class of logical reversal verb-pairs was constructed in such a 
way that the first verb always implied a previous action in which the 
second verb was a response. The use of "reversal" denotes a change of 
subject-object relationship. For example, John understood Bill implies 
that Bill asked (questioned, called, etc.) John ; consequently, in order 
to decide on the referent of he in the sentence John understood Bill and 
he answered him (back) . the ^ had to rely on some implied sentence, as, 
for example. Bill questioned John and he answered him (back) . Once the 
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implied sentence was constructed, interpretations two and four were al- 
most certain, i.e., he * John * Hence in the original form, John under- 
stood Bil l and he answered him (back) , -the fact that he = John corre- 
sponded to interpretations one and three. For the other verb-pairs in 
the logical reversal class, the verb hear implied most likely the verbs 
ask, yell . call , etc; the verb remember implied most likely the verbs or 
verb phrases call, leave a message , make an appointment , etc; the verb 
recognize implied most likely the verbs meet , call , surprise , etc. 

The class of identical verb— pairs was constructed from verbs that 
could be repeated in a sentence, e.g., John hurt Bill and he hurt him 
_(bac^. The class of similar verb-pairs was constructed from sets of two 
verbs which denoted very closely related actions, e.g., <hit, kick>. The 
order of the two verbs in the class of similar verb-pairs is assumed to 
have no difference in the choice of the pronominal referent, e.g., John 
hit Bill and he kicked him (back) and John kicked Bill and he hit him 

have the same referent for he. But such is not the case for the 
class of logical reversal verb-pairs, e.g., John understood Bill and he 
answered him (back) and John answered t'lll and he understood him (back) 
have very different referents for the pronoun he. All the verb-pairs in 
the above three classes, except <remember, phone> and <recognize, invite>, 

were selected from the set of verb-pairs that a group of _Ss supplied in 
the previous study. 

Nonsense syllables formed another class of verb-pairs. The purpose 
of using the nonsense words was to investigate the transmission of pronom- 
inal reference by the structure of the sentence rather than by means of 
extra-linguistic information. Presumably the sentence with nonsense verb- 
pairs would be responded to least uniformly, especially by the younger 
children. These nonsense verb-pairs were randomly selected from a set of 
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consonant-vowel-consonant syllables having a Krueger index from 50 to 59 
[Krueger, 1934]. These belonged to the set with a relatively small number 
of associations. The ^s were specifically instructed not to substitute 
real verbs for the nonsense ones, but to decide the pronominal referent 
solely on the structure of the sentence. Of course there was no guarantee 
that the ^s did not supply their own lexical meaning for the nonsense verbs 
before they decided on the pronominal referent. But the fact that sen- 
tences with nonsense verbs were responded to less uniformly than the same 
sentences with real verbs indicated that the ^s had more difficulty with 
this set of sentences. In addition, the average time for responding to the 
nonsense sentences (11.3 seconds for college sophomores) was longer than 
sentences with real verbs (10.5 seconds), but much less than the average 
time for writing a pair of real verbs before responding to sentences with 
nonsense verbs, as estimated from the previous study (24 seconds). 

2.4 Sentence Generation Procedures 

For each sentence frame, 16 sentences were constructed from all com- 
binations of active (A) , negative (A) , passive (P) and negative-passive (P) 
grammatical forms of the two individual clauses. Ideally each verb-pair 
class needed 16 members, but it was almost impossible to construct that 
many verb-pairs within the vocabulary range of the fifth graders (except 
in the case of the nonsense verb-pair class) without repetition and with- 
out altering the class Interpretation. Thus, the decision was made to 
use four words for each of the real verb-pair classes. A ’’Latin square” 
procedure as shown in Table 2-1 was used in assigning the verb— pairs to 
the individual sentences of a sentence frame. The rows of the matrix in 
Table 2-1 correspond to the grammatical form (A, A, P or P) of the first 
clause while the columns correspond to the grammatical form (A, A, P or P) 
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of the second clause. Each cell, then, corresponds to a particular sen- 
tence. For example, the A-P cell (row A and column P) corresponds 
to the sentence John did not Y Bill but he was Xed by him . The numbers 
(1 to 4) in each cell correspond to the first to fourth verb-pair of 

Table 2-1: "Latin square" for the 

assignment of verb-pairs. 




each of the three classes of real verb-pairs. Thus for the He Xed him 
sentence frames, the number 3 in the above A-P cell indicates that 
the third verb-pair from each verb class was substituted for the verbs 
Y and X in the sentence John did not Y Bill but he was Xed by him ; 

Identical: John did not strike Bill but he was struck by him. 

Similar: John did not help Bill but he was assisted by him. 

Logical Reversal: John did not remember Bill but he was phoned 

by him. 

The following were the number of sentences thus generated. He Xed 
him and He Xed him back sentence types each had four classes of verb- 
pairs and 16 different grammatical forms. Thus there were 2 x 4 x 16 
* 128 sentences. In addition, the Each other sentence type had three 



classes of verb-pairs (the identical verb-pairs were not allowed because 



It was felt that some of the resulting sentences would be anomalous, or 
even ungrammatical, as for example, John and Bill did not strike each 
other but he struck him) and 16 different grammatical forms, giving 
48 sentences. Thus there were all together 128 + 48 - 176 sentences. 

For each sentence, the choice of John or Bill as the grammatical 
subject was random. The set of 176 sentences were grouped into two 
parts, one containing 48 sentences that had nonsense syllables as the 
verb-pairs, the other containing 128 sentences that had the real verb- 
pairs. Within each part, the sentences were typed in a random order. 

The sentences are reproduced in Appendix A. 



Chapter 3 



DATA 



In this chapter the data collection procedures will be described. 
First, the considerations involved in choosing the age groups will be 
discussed, then the procedural instructions for the ^s will be presented, 
and finally the preparation of the resulting data for computer analysis 
will be explained. 

3.1 Choice of Age Groups 

The data for the investigation consisted of responses from four age 
groups. The age groups were selected with the aim of covering the period 
of transition from near inability to resolve the pronominal referent to 
near uniformity in the choice of the pronominal referent. Even though 
some psychologists have shown that children of four to six years can com- 
mand the passive and negative transformation fairly well [Slobin, 1966; 
Brown and Fraser, 1964; McNeill, 1965], much more linguistic competence 
is required to select the appropriate pronominal referent. For example, 
it was necessary to generalize from the everyday sentence Johnny hit me 
and I kicked him back to John beied Bill and he (Bill) gowed him back and 
further to John was not zeved by Bill but he (Bill) was kezed back by him. 
It was decided to use the written form, rather than the oral form, for 
the following reasons: some of the sentences were fairly long and were 

expected to exceed the immediate memory span of younger children; some of 
the nonsense syllables were difficult to pronounce; it was impossible to 
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decldfi how many tlmas a sentence should be repeated orally to Insure com” 
prehension and whether the number should be constant for all sentences 
and/or all age groups; and finally, individual oral testing would have 
been too time consuming, whereas group oral testing would have been too 
difficult to administer. 

Once the written form was decided upon, considerations of reading 
ability required that the youngest subjects be at least in the fourth 
grade. Finally fifth grade was selected as containing the lowest age 
group that would reasonably well insure adequate reading ability and in- 
terest. Interest was a necessary criterion, since the testing was to 
occur during regular class hours. Seventh and ninth grades were also se- 
lected for the tests, and in addition college sophomores were chosen to 
represent the adult language population. Unfortunately, the time for 
testing the ninth grade was during the last class of the day, which appar- 
ently led to much random responding in an attempt to finish quickly. Sub- 
sequently, a large group of eighth graders were tested and to be tested 
again three months later to see if these children would respond differently 
(hopefully more uniformly) after they were introduced to some modern con- 
cepts of structural linguistics. The results from the first testing (before 
structural linguistics) for the eighth grade children were used in place 
of the ninth graders because their average IQ*s match more closely to the 
seventh graders. However, the responses of the ninth graders are presented 
in Appendix B of this report. 

The fifth grade children came from the Adams School and the seventh, 
eighth and ninth grade children came from the West Junior High, all of which 
are public schools in Ypsilanti, Michigan. The college ^s were students at 
the University of Michigan and were paid. The children were tested for 
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approximately 40 minutes in their regular class session, without any 

■ '! 

monetary compensation. The college Ss were tested in two groups, one In { ! 

the afternoon and the other in the evening. 

3.2 Procedural Instructions for the Subjects. | ' 

Ij 

The testing materials were prepared in the form of a booklet with 
written Instructions. The experimenter (the writer) always described jj 

briefly the purpose of the experiment to the Is and explained the in- 
structlons with some specific examples. The booklet was divided into U 

two parts, one containing 48 sentences that had nonsense syllables as p 

^ iU 

the verb-pairs and the other containing 128 sentences with real verb- 

pairs. Following the normal counterbalancing procedure, half of the Sb | 

in each age group worked the set of nonsense verb-pairs before the set of 
real verb-pairs, while the other half worked in the reverse order. They J 

were instructed to record the beginning and terminating time for each | 

part. 

The Ss were instructed to mark an ”x" above the word John or Bi^. to |j 

denote the most appropriate referent of he. They were further instructed 
to answer every sentence in the given order . One or two ^s who responded { 

with an "x” on the word for the first mentioned person (i.e., the first | 

word) in every sentence on at least five of the seven pages were discarded 
from the analysis. In the children’s groups, some of the £s were not able j 

to finish during the class hour; hence the set containing the unfinished 
sentences was thrown out. The result was that the total number of ^s was 
reduced for some of the frames. In particular the eighth grade had one 
fewer male S (25 male Is instead of 26) , the seventh grade had one fewer 
female S (12 female ^s instead of 13) , and the ninth grade had one fewer 
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male ^ (13 male instead of 14) . All discarded sentences contained 
the set of nonsense verb-pairs. Hence for the nonsense He Xed hlTn frame, 
the nonsense He Xed him back frame and the nonsense Each other frame, the 
total number of male eighth grade ^s was 25, the total number of female 
seventh grade ^s was 12 and the total number of male ninth grade ^s was 13 

3.3 Data Collection 

The results were transcribed by hand to a coding sheet. A response 
to the grammatical object-slot (i.e., the mark "x" was above the word that 
was the grammatical object of the first verb) was recorded as 1 and a 
response to the grammatical subject-slot was recorded as 0. For the Each 
other sentence type the first mentioned person was taken as the subject 
and the second mentioned person as the object. This was only intended as 
a notational convenience so that the terms "subject" and "object" could be 
used for all three sentence types. These coding sheets were then punched 
and verified. A MAD program was written to accept these data cards to 
tabulate the number of l*s (object-slot) for each group of ^s (males and 
females separately) and for each of the 16 sentences in a sentence frame. 
These results were punched out as data for an information transmission 
analysis program. The results for the college sophomores, for example, 
appear as 4x4 response matrices in Tables 5-1, 5-2 and 5-3. The number 
in each cell corresponds to the number of ^s who responded with the object- 
slot. Hence the difference between the number in each cell and the total 
number of ^s is the number of responses with the subject-slot. The "total" 
matrices are pooled data from both sexes. 




Chapter 4 



MODELS OF ANALYSIS 



In this chapter three mathematical models are proposed for analyzing 
the observed data. Each model will be defined and interpretations will 
be supplied to relate the experiment with the mathematical model. The 
Bernoulli trials model partitions the entire set of sentences into two 
subsets, one containing the unambiguous sentences, the other containing 
sentences with ambiguous pronouns. The k-limited transducer model char- 
acterizes the human as a sentence processor with very limited memory. 

This model explains the degenerative behavior (tendency toward random re- 
sponding) when sentences become more complex syntactically or when ^s are 
younger, other variables remaining constant. Finally, the information 
transmission model characterizes the ^s' task of reading each sentence and 
giving a response for the pronominal referent as a communication system. 

It defines the term '^transmitted information" as a measure of the degree 
of predictability of the pronominal referent from the structure of the 
sentence. 

4.1 Bernoulli Trials Model 

Feller defined Bernoulli trials as "repeated independent trials, 
each with only two possible outcomes, and their probabilities remain the 
same throughout the trials" [1957, p. 135]. In other words, we let there 
be n trials, each with probability p in obtaining the outcome 1, and 
probability 1 - p in obtaining the outcome 0. Let the random variable 
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til 

represent the outcome of the i independent repeated trial, i = 

1,2, . . . , n. Then the random variable Y * X- + X« + . . . + X 
’ 1 2 n 

represents the outcome of Bernoulli trials of n independent events 
(or trials) . The random variable Y has a binomial distribution which 
is tabulated for different values of p and n (e.g.. Tables of the 
Cumulative Binomial Probability Distribution , Harvard University Press, 
1955) . 

We can then calculate the probability of obtaining an observed value 
of Y in an n- trial. But for a large value of n, this probability is 
always very small; hence it is better to calculate a confidence interval 
[u,v] such that for a given choice of a, the probability of Y ^ u or 
Y ^ V is < a. For this experiment, a is chosen to be .01. Hence 
the probability of obtaining a Y within the confidence interval is 
1 - 2a or .98 when we assume all the S_s in a group responded in random 
way, i.e., that each response is a Bernoulli trial. Hence if Y falls 
outside the confidence interval [u,v], then it is unwise to assume that 
^s' responses can be described by Bernoulli trials. 

The Bernoulli trials model is used in the present experiment to sep- 
arate the sentences which have ambiguous pronominal referents from those 
that do not. The probabilistic approach allows us to select a certain 
chance level a of being wrong. In the following we will describe our 
experiment in relation to the model, and will employ a statistical method 
for estimating the value of p. 

The task of each ^ in this experiment was to select the most appro- 
priate referent for the pronoun he in each sentence. Since he was in- 
structed to mark an "x" on top of either John or Bill as the referent of 
he, the S_ had only two choices as he read each sentence. There is a cer- 
tain probability p that John would be selected along with the probabil- 
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Ity 1 - P that BiU would be selected. Hence the outcome of a single S 
responding to a particular sentence is a trial with two outcomes. In- 
stead of assuming p to be .50. we will describe a method later to es- 
timate the value of p from the group responses so that it will represent 
each s probability of responding with John. Since xt is reasonable to 
assume that each S in a group responds independently of any other in the 

same group, we have a case of Bernoulli trxals. 

Since by experimental design, the choice of John or Mil as the gram- 
matical subject of each sentence was random, it would be more revealing 
to consider the outcome, not as John or Mil. ^ut as a grammatical subject- 

slot or a grammatical object-slot. Thus we let the random variable = 1 

, , . 1 nrtA V =s 0 wli6ii tlis outcoins is 3 sub”" 

when the outcome is an object-slot, and 

ject-slot. Then the random variable Y - Xj^ + Xj + • • . + X^ xs the num 
her of Is out of n who responded with the object-slot. The numbers xn 
the response matrices such as those in Tables 5-1, 5-2 and 5-3 are the val- 
ues of Y for each sentence in the college sophomore group. The total 
matrices are the cell-by-cell addition of the corresponding male and female 
responses. For each value of Y, there is an implied number n - Y whxch 
corresponds to the number of Ss who responded with the subject-slot. 

The Interpretation of the confidence interval is the following. If 
a sentence has an ambiguous pronoun he. t**®" ® forced choxce 

condition, has to select a referent arbitrarily. There may, nev 
be some bias toward the grammatical object (or subject) slot so that the 
value of p may not be .50. Whatever the value of p, the sum of the in- 
dividual random responses for a group of £s follows a binomial dxstr xbutxon . 
Hence if an observed Y falls within the confidence interval [u.v] . we 
a 085: chance that the corresponding sentence elicxtated random re 
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spondins due to its a».iSuous pronoun. On the other hand, if a sentence 

has an una»higuous pronoun, then each S would have responded wxth the 

aa»e referent. Hence the resulting value of . is either very large. 

» , (when the referent is the object-slot) or very small. .< u (when 

the referent is the subject-slot) . Since the value of Y is outside the 

confidence interval [u.v]. we would not expect it to have resulted from 

random responding by the Ss. The confidence interval serves to partition 

the set of sentences into two subsets, one containing those sentences with 

an ambiguous pronoun, the other containing those sentences with an unam- 

n..n Therefore the Bernoulli trials serve as an operational 
biguous pronoun, inereroxe 

definition for the concept of pronominal ambiguity. 

The value of p is estimated in the following way. Since each age 

group, each sentence type, and each set of verb-pairs may cause a certain 
tias toward the object (or subject) slot, we are forced to calculate p 
separately for each sentence frame in an age group. Since all the 16 
sentences in a frame are different ways of expressing the same idea, they 
provide 16 sample observations by which we can estimate p. bet these 

values be denoted as Y,. i = l^* 

number of Ss (from a total of n Ss) who responded with the grat»atical 
Object-slot to the 16 sentences of the frame. Since these numbers are 
the results of Bernoulli trials with unknown p. we can estimate this P 
By the maximum likelihood estimator [Mood. 1950. Chapter 8]. 

16 
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The numbers in each response matrix such as those shown in Table 5-1 
are the number of Ss who responded with the gramatical object-slot to 
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the 16 sentences of a sentence frame, i.e., they are the The 

maximum likelihood estimator p and the confidence Interval [u,v], where 
a Is chosen as .01, are written Immediately under each matrix. Those 
Y^*s which fall outside the confidence Interval [u,v] are starred. The 
sentences corresponding to the starred cells are Interpreted as unambig- 
uous sentences. 

4.2 K- limited Transducer Model 

The following formal model Is used to account for the degenerative 
behavior (tendency toward random responding) both within a sentence frame 
when sentences become more complex and across age groups when ^s are youn- 
ger. Correct responding depends on the ^s having decoded the sentences 
Into their deep structures and on some general knowledge of the use of the 
English language In describing two actions between two persons. We can 
use a k-llmlted transducer that takes as Input the given sentences and 
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Figure 4-1: Model of human sentence processor. The kT limits the 

decoding of a sentence and the k*T limits the assign- 
ment of the pronominal referent. 
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produces as output the deep structures of the input sentences. .The out- 
put is then fed to another k— limited transducer which uses the general 
rules of English usage to assign an output of John or Bill . A diagram 
of the k-limited transducer model is shown in Figure 4-1. 

A k— limited transducer kT is a device that can read an input tape 
and write an output tape. Let Aj., Aq and 2 be finite sets of input 
symbols, output symbols and internal states respectively. Then the trans- 
ducer kT is completely specified by a set of transition rules of the 
form: 

-.X, 2^> <y, 2 > 

iJ 

where 

X is a string over the input s3mibols A^ and its 
length lg(x) < k, 

y is a string over the output symbols A^, 

2^ is an i internal state, 

2^ is a j internal state. 

The rule specifies that when the kT is in its state 2^, reading an 
input string x of at most k units, the kT writes the string y on 
the output tape while its internal state is moved to another state 2 . • 

d 

During this process, the input tape moves to the right lg(x) units so 
that the kT is ready to read the next string of at most k units. Like- 
wise, the output tape also moves to the right lg(y) units so that the 
kT is ready to write the next output string. The kT is a more general 
transducer than the finite transducer defined by Chomsky [1963, p. 346], 
which is an 1-limited transducer. 

The intended interpretation for the k-limited transducer in the pre- 
sent experiment is to consider each ^ as a sentence processor. The S de- 
codes each clause of the sentence into its deep structure and uses his 
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knowledge of English (l*e«, his linguistic competence) to assign the 
pronominal referent. The input and output symbols may be taken as the 
words. As for the decoder stage, there is a certain limitation up to 
which the ^ can process correctly. Since the task was to identify the 
referent of the pronoun the S_ must first discover the subject-object 

relation in the two clauses so that the two actions can be connected 
with John and Bill . When a clause Is expressed in the active form, there 
is very little processing needed to identify the subject and the object, 
since the surface structure is the same as the deep structure. When a 
clause is in the negative form (still active), the subject— obj ect relation 
has not changed from the affirmative form, and for the present task, the 
negation is not expected to affect the choice of the pronominal referent. 
However, when a clause is in the passive form, the subject-object relation 
in the surface structure is just the opposite of that in the deep struc- 
ture. Hence a certain amount of processing is needed to obtain the rela- 
tionship between the logical subject and the logical object. The k in 
the kT reflects not only the length of the clause but also the "program" 
which obtains this relationship of the sentence. Thus, although the 
length of a clause is the same no matter which class of verb-pairs is 
used, the "program" for the real verb-pairs could be shorter than for the 
nonsense verb— pairs. Therefore, given a certain decoding capacity k 
for a _S, if the length of the clause and the demands of the processing 
program exceed this k, then that clause will be erroneously decoded. 

We do not know how the ^ stored the decoded message, but the subject-object 
relationship and the particular verb used must be clearly stated. Since 
each sentence consists of two clauses, the decoding process can proceed on 
one clause at a time. For example, the four grammatical forms in the first 
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clause may be decoded as follows: 



John Yed Bill 
John did not Y Bill 
John was Yed by Bill 



-»■ Yed [John, Bill] 

-»■ not Yed [John, Bill] 

-»■ Yed [Bill, John] 



John was not Yed by Bill not Yed [Bill, John] 

The decoded message shows clearly the logical-subject and the logical- 

object of the clause within the square bracket. For the second clause, 

however, we need only know the logical function (subject or object in 

the deep structure) for the pronoun he and the type of sentence. Hence 

the eight different clauses may be decoded as follows: 

he Xed him (back) ->■ Xed (back) [subject] 

he did not X him (back) -*• not Xed (back) [subject] 

he was Xed (back) by him Xed (back) [object] 

he was not Xed (back) by him not Xed (back) [object] 

For example, the sentence 

John Yed Bill but he was not Xed back by him 

has the following as its decoded message, 

Yed [John, Bill] but not Xed back [object]. 

The decoded message is then fed to a second transducer k’T where 
the rules of English usage are applied to assign an output, either John 
or Bill . The k* reflects the maximum length of the decoded message plus 
a "program" which assigns the pronominal referent. Again It is necessary 
to postulate an assignment "program" which is concatenated with the decoded 
message, because the difficulty in making the assignment varies with the 
class of verb-pairs and the sentence type. Thus the level of difficulty 
is reflected in the length of the "program" for making the assignment. 
Hence, if two given sentences can be easily decoded but one is more dif- 
ficult to assign than the other, then we can expect that the more difficult 
sentence will not receive a uniform response from a group of ^s. In terms 
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of the k-l±mited transducer model we say that both sentences are within 
the k of the decoding transducer, but the more difficult one has ex- 
ceef'ed the k’ of the assignment transducer due to the longer assign- 
ment "program”. 

There is some behavioral evidence in support of including the "pro- 
gram" as part of the input string of symbols. Mehler [1963] and Mehler 
and Miller [1964] found that when recalling English sentences, the most 
frequent errors were of a syntactic type, i.e., errors of recalling a 
grammatical form different from the one given. The authors suggested that 
the _Ss decoded the given sentence into its kernel form and tagged it with 
the necessary transformational rules. Savin and Perchonock [1965] per- 
formed an experiment in which the £ was presented with a sentence followed 
by a sequence of eight unrelated words. The task was to recall the sen- 
tence verbatim and as many of the unrelated words as possible. Then an- 
other sentence and a sequence of eight different words was presented to 
the S_ for recall. The sentences differed from each other by a number of 
grammatical transformations, e.g., the passive, the question, the negation, 
the emphasis, and the wh transformations. The hypothesis was that if the 
^s were to decode each sentence into its kernel form plus tags indicating 
the necessary transformations, then for a given memory capacity the more 
complex sentences would take up more "space", and as a consequence there 
would be less "space" to remember the additional unrelated words. Indeed, 
the number of additional words recalled correctly was greatest for the 
kernel sentences, and decreased for those that had more transformations. 

In other words, the "program" for encoding the decoded kernel sentences 
was longer for the more syntactically complex sentences, i.e., those with 
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more transformations. 



The experimental results of Epstein [1961] and Rosenberg [1966] sup- 
port the concept of including a "program" as inputs to the assignment 
k-limited transducer* Epstein found that semantically anomalous, yet 
grammatical, sentences were learned faster than the same sentences with 
nonsense words. Rosenberg found that for both grammatical and ungrammat- 
ical sentences, there was a significant difference in recalling the sen- 
tences when the association levels were changed for the content words* In 
the present experiment, we have four classes of verb-pairs for each sen- 
tence type* Thus we would expect the sentences with nonsense verb-pairs 
to be the most difficult, those with identical verb-pairs to be the eas- 
iest, and those with either the similar verb-pairs or logical reversal 
verb-pairs to fall somewhere in between* To characterize this level of 
difficulty, which is not simply the length of sentences, we hypothesize 
an "assignment program" in parallel to that of a "decoding program"* 

4*3 Information Transmission Model 

An information transmission model may be diagrammed as in Figure 4-2* 
The input sources A, B, * * * , N have A, B, * * * , N symbols re- 
spectively, and the output source Z has Z symbols* The symbols from 
the input sources are encoded into a message which is transmitted to the 
destination. At the destination, the received message is decoded to pro- 
duce an output symbol* In the transmission process, there is a certain 
amount of noise introduced which makes it impossible to predict the out- 
put symbol with complete accuracy from the input symbols. 

We can interpret the task of each ^ through the information trans- 
mission model in the following way* There are three input sources. A, 

B and C* Source A has four symbols A, A, P and P which denote the 
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grammatical form of the first clause of a sentence; source B also has 
four symbols A, A, P and P which denote the grammatical form of the 
second clause of the sentence; source C has two symbols M and F 
which denote the sex of the ^ who is responding to the sentence. Each 
sentence of a sentence frame is thus an encoded message from these three 




Figure 4-2; Diagram of an information transmission model. 

input sources. The encoded message is read and decoded in the brain from 
which an output symbol is produced. The output source consists of two 
symbols, John and Bill or subject and object or 0 and 1. In the pro- 
cess of transmission from the input symbols to the output symbols, a cer- 
tain amount of noise is produced so that the output ssnnbols are not com- 
pletely dependent on the input symbols. In the present experiment, many 
different factors could have contributed to the non-uniformity of the Ss» 
responses (l.e., the noise component), among which are mistakes in decoding 
sentences, nonequivalent linguistic knowledge among the^s, insufficient 
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cues in the sentence structure (as in the sentences from the Each other 
type) , or insufficient memory capacity to perform this task (as in the 
fifth grade group) . Such internally generated noise is not controllable, 
but there may still be sufficient dependency between the input and the 
output sources to allow a prediction of the output symbols that is sig- 
nificantly better than chance. 

Other variables, such as the verb-pairs, the sentence types and the 
age groups could have been included as input sources. But since the mea- 
sure of "information", which is defined later, is the average value over 
a random variable, it would be meaningless to calculate such an average 
value when we expect very different responses among the elements of the 
random variable. For example, we expect that the Each other sentence 
type would elicit random responding while the other two sentence types 
would elicit uniform responding, especially for the older groups. Hence 
by averaging the responses from the three sentence types, we lose all the 
distinctions among the sentence types. Similarly, because the logical 
reversal verb-pairs are expected to elicit responses opposite (i.e., re- 
versed) from those of the identical verb-pairs , that distinction would 
also be lost if the verb-pairs variable were averaged. Also, we expect 
the different age groups to respond differently so that an average response 
from the four age groups would not represent any population group. How- 
ever, when we treat each sentence frame as a unit, the 16 sentences are 
different ways of expressing the same idea. Therefore, the average of 
these 16 responses (represented by the two variables A and B) is the 
measure of the degree of ambiguity of that sentence frame. The sex is 
taken as a variable because we want an average response for each age group, 
especially when the male and the female responses do not differ signifi- 
ERJC cantly. 
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Before we turn to a mathematical derivation of a 3-input source and 
1-output source information transmission model, we will give an informal 
outline of the procedure and the important concepts that are involved. 

The derivation begins with a 1-input source and 1-output source model which 
is exemplified by a 2-dimensional table where the rows correspond to the 
input symbols and the columns correspond to the output symbols. Each cell 
in the table corresponds to the event that the input symbol as designated 
by the row has elicited the output symbol as designated by the column. 

The number in the cell denotes the frequency with which that event has 
occurred. With such a contingency table, the probabilities are estimated 
from the cell values so that Shannon’s information measure can be applied, 
one for the output source H(Z) , one for the input source H(A) , and one 
for the joint events H(A,Z). Then the transmitted inforn..f -f nn t(A;Z) 
is defined between the input source A and the output source Z as 
T(A;Z) * H(A) + H(Z) - H(A,Z). 

The term T(A;Z) is a measure of the dependency between the output and the 
input sources, and it varies from zero to a maximum value of H(Z). If it 
is zero, then the output source is independent of the input source; that 
IS, any knowledge of the input symbols does not help to predict the output 
symbols. If T(A;Z) = H(Z) , then the output source is completely deter- 
mined by the input source; that is, every output symbol is uniquely deter- 
mined by an input symbol. Another equivalent relationship for the trans- 
mitted information term is 

T(A;Z) = H(Z) - H^(Z), 

where H^(Z) = H(A,Z) - h(A) is the noise term. Hence the case of a com- 
plete dependency between the input and the output sources (i.e., when 

T(A,Z) - H(Z)) is equivalent to a noiseless communication system, and the 
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case of no dependency between the input and the output sources (i.e., 

when T(A;Z) = 0) is equivalent to a communication system that is a noise 
generator. 

The usual case is, however, for T(A;Z) to be between zero and 
H(Z) , which reflects some dependency between the input and the output 
sources. Fortunately there is a test of significance by chi-square val- 
ues for T(A;Z) for the hypothesis of no dependency. That is, given a 
certain probability (either .01 or .001) of rejecting the hypothesis 
of no dependency, we can determine whether the obtained T(A;Z) is sig- 
nificant (when p is chosen as .01) or "highly" significant (when p 
is chosen as .001). The significant and the "highly" significant T-terms 
imply that the output symbols can be predicted from the input symbols bet- 
ter than chance. 

From the 1-input source and 1-output source model, we then move to 
the 2-input source and 1-output source model (3-dimensional contingency 
tables) and finally to the 3— input source and 1— output source model 
(4-dimensional contingency tables). The concepts involved are, however, 
similar to those of the 1- input source and 1-output source model. Since 
there is now more than one input source, there is more than one T-term, 
each of which measures the dependency between the input and the output 
sources. The chi-square distribution provides a test of the significance 
for each of these terms. For the 3-input sources A, B and C and the 
1-output source Z, T(ABC;Z) is the transmitted information between the 
joint input symbols from A, B and C and the output symbols from Z. 
When we reduce the 4-dimensional contingency tables to 3-dimensional con- 
tingency tables by summing over one input source, say, source C, then 

T(AB;Z) is the transmitted information between the joint input symbols 
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from A and B and the output symbol from Z. For the present experi- 
ment, T(AB;Z) corresponds to the transmitted information between AB 
and Z when the responses by the male and the female Ss are pooled to- 
gether. When we treat the male responses and the female responses sep- 
arately, we have the T(AB;Z)/M and T(AB;Z)/F respectively. Then by 
further reducing the contingency tables to 2-dimensions, we have the terms 
T(A,Z), T(B;Z) and T(C;Z) which are the T-terms between one input 
source (summed over the other two sources) and the output source Z. 

In the 3-input source and 1-output source, besides the T-terms, 
there are four 3-way interaction terms and one 4-way interaction term. 

The 3-way interaction terms measure the amount of transmitted information 
that is gained (when positive) or lost (when negative) between any two 
sources when the third source is known. Likewise for the 4-way inter- 
action term, except now it has a fourth source. 

There is a very close relationship between the information trans- 
mission model and that of analysis of variance. The interested reader 
is referred to Gamer and McGill [1956]. In the remaining part of this 
section, we will define the information transmission model for a 3-input 
source and 1-output source and define formally such terms as "informa- 
tion", "transmitted information" and "interaction". The formulae thus 
derived were used in a computer program, which was written in the MAD 
language, to take as inputs the numbers in contingency tables and print 

out the different T-terms, along with their chi-square values and the 
interaction terms. 

The information transmission model may best be illustrated by con- 
sidering only one input source A and one output source Z. Let the in- 
put have A symbols, with probability p(a) that the a^^ 
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A 

is sent, where Z p(a) = I p(a) = 1; let the output have Z symbols, 

Si 

til 

with probability p(z) that the z output s3nnbol is received, where 

Z 4-^ 

p(z) = 2 p(z) = 1; and let the joint probability that the a in- 

til 

put symbol is sent with the z output symbol received be p(a,z), 

where Z p(a,z) = p(a), Z p(a,z) = p(z) and Z p(a,z) = 1. Hence we 

Z Si SiyZ 

can use Shannon's information measure as follows: 

H(A) = -Z p(a) log p(a) 

ci 

H(Z) = -Z p(z) log p(z) 

Z 

H(A,Z) = - Z p(a,z) log p(a,z). 

a ^ Z 

The logarithm will be taken to the base two; hence the units for the 
H-terms will be in bits. 

If the input and the output sources are independent (i.e., knowledge 
of the input symbol does not change the probability distribution of the 
output symbol) then p(a,z) = p(a) p(z); hence, 

H(A,Z) * - Z p(a,z) log p(a,z) 

SiyZ 

* "a?z [p(a) p(z)] 

P(a,z) [log p(a) + log p(z)] 

" " a z P(a) - I I p(a,z) log p(z) 

= - Z p(a) log p(a) - Z p(z) log p(z) 

ci Z 

« H(A) + H(Z). 

However if the output symbol is uniquely determined by the input symbol, 
then the joint probability is equal to the input source probability, i.e., 
p(a,z) * p(a). Hence H(A,Z) reduces to H(A). In general, whenever 
the input and the output sources are not independent, then H(A,Z) < H(A) + 
H(Z). 

We define the transmitted information T(A;Z) between the input 

^ source A and the output source Z as, 

ERIC 

““ T(A;Z) « H(A) + H(Z) - H(A,Z). 
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T(A;Z) is a term which measures the dependency or predictability of 
output symbols from the input symbols* It varies from zero for no de- 
pendency, to a maximum value of H(Z) for complete dependency. From 
probability theory we know that the joint probability p(a,z) is re- 
lated to the conditional probability p (z) and p(a) by the equation, 
p(a,z) « p(a) p^(z) . 

cL 

Therefore we define a conditional information term H. (Z) as, 

H^CZ) = p(a,z) log p^(z) 

= P(a,z) log-E^ 



= [p(a,z) log p(a,z) - p(a,z) log p(a)] 

“ p(a,z) + 2 p(a) log p(a) 

» j « a 

= H(A,Z) - H(A). 



H^(Z) denotes the information or uncertainty of the output symbol when 
the input symbol is known; that is, it is a measure of the noise that is 
generated in an information transmission model or communication system. 
For the present experiment, the factors which contributed to the non- 
uniform responses from the Sa make up the noise component. When we sub- 
stitute this term into the equation for T(A;Z), then we obtain the fol- 
lowing relation; 



T(A;Z) ^ H(Z) - H^(Z). 

In other words, the transmitted information is the difference between the 
information in the output source and the noise component. Thus the case 
of no dependency between the input and the output sources, i.e., when 
T(A;Z) * 0, is equivalent to a communication system which generates noise 
(i.e., a random generator). The case of complete dependency between the 
input and the output sources, i.e., when T(A;Z) * H(Z) , is equivalent to 
a noiseless communication system (i.e., one where each output symbol is 
uniquely determined by an input symbol) . 
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In an experimental situation, these probabilities p(a), p(z) and 
p(a»z) can be estimated from the observed data, in the form of contin- 
gency tables. In this 2~dimenslonal case, the tables are matrices with 

S5 

A rows and Z columns. Each cell (a,z) contains the number n(a,z) 

which corresponds to the event when the a input symbol is sent out 
ttl 

and z output symbol received. Then the number of observations of the 
ttl 

a input symbol, n(a), regardless of what is received, is n(a) * 

I n(a,z)$ i.e., the row sums. Similarly the number of observations of 
ttl 

the z output symbol, n(z) , regardless of what symbol is sent out is 
n(z) « I n(a,z); i.e., the column sums. Similarly, 

n « Z n(z) « Z n(a) « Z n(a,z). 

^ cl cLyZ 

The maximum likelihood estimates of p(a), p(z) and p(a,z) [Mood, 1950, 
chapter 8] are, 

p(a) « n(a)/n 
p(z) = n(z)/n 
p(a,z) « n(a,z)/n. 

Based on this estimate of the probability terms, we can calculate the 
various information terms. 

In order to facilitate the manipulation of the estimated probabil- 
ities, let us introduce the following notation; 

Saz “ n a^z n(a,z) 

®a " n a 

®z * n z 

S * log n 

Then, 

er|c 
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H(A) = p(a) log p(a) 

= I n(a) log n(a) + “ | n(a) log n 

“ £ n(a) log n(a) + log n 

= S - S . 

d. 

Similarly , 

H(Z) = S * S 

2 

H(A,Z) « S - S . 

az 

Therefore, 

T(A;Z) « H(A) + H(Z) - H(A,Z) 

*S-S - S +S . 
a 2 az 

Now suppose there are two input sources A and B with A and 

S3nnbols respectively and an output source Z with Z symbols. Let the 
til 

a input symbol from source A have a probability p(a) of occurring 
A , 

where p(a) « | p(a) « 1; let the b input symbol from source B 

B 

have a probability p(b) of occurring where p(b) = g p(b) * 1; and 

til 

3.et the z output symbol have a probability p(z) of occurring where 

1 

information measures, in addition to those 

already defined, are: 

H(B) « p(b) log p(b) 

H(A,B) as p(a,b) log p(a,b) 

H(B,Z) a -^2^ p(b,z) log p(b,z) 

H(A,B,Z) a p(a,b,z) 









p(a,b,z) log p^^(z) a H(A,B,Z) - H(A,B) 
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For the transmitted information term, we are now considering the in- 
put as two separate sources; hence, 

T(AB;Z) « H(A,B) + H(Z) - H(A,B,Z) 



= S- S,-S + S^, 
ab z abz 



where 



1 y ' 

®abz “ n n(a,b,z) 

®ab “ n afb ”(“>*>)• 

Equivalently, let H^g(Z) be the noise generated in a 2-input and 1-output 
information transmission model, then 
T(AB;Z) « H(Z) - \g(Z). 

The number n(a,b,z) is the number of observations in cell (a,b,z) of 

a 3-dimensional contingency table and it corresponds to the event when 

the a^^ and b^^ symbols from the respective A and B input sources 

till 

are sent out and the z output symbol is received, out of a total of 
n observations* The number n(a,b) is the number of observations in 

(s,b) summed over the Z— dimension and it corresponds to the event 
when the a^^ and b^^ symbols are sent out regardless of which symbol 
is received. Other S- terms for the 3-dimensional contingency tables will 
appear later and are assumed to be defined in similar ways. 

Of the three sources, one can eliminate any one of them to consider 
transmitted information between the other two sources. Suppose we elim- 
inate the source B, and consider the term T(A;Z). There are two ways 
in which the source B may be eliminated. One way is to sum over the 
B-dimension of the 3-dimensional contingency table, effectively reducing 
it to an A by Z 2-dimensional table, i.e., n(a,z) « g n(a,b,z). 
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Hence , 



T(A;Z) = H(A) + H(Z) - H(A,Z) 



*S-S - S + S . 
a z az 



The other way is to compute the information transmission between A and 
Z separately for each symbol b from source B and then average these 
together. Hence, 

T (A;Z) - E [T, (A;Z)] 

° b n b 



* Z 



nCb) 



b n 



» 2 qla.»b). - iLCa^b) n(b,z) 



n(b) z n(b) 



n(z) 



. y n n(a,b ,z) 

^ a,z n(b) 



- - i .h, •<••» 1"« *”« 



4. i 2ZL nCa b z^ loff -S-Catba-^). 
^ n a,b,z n(b) 



* S, - S , + S, - S, “ S, + S , 
b ab b bz b abz 



“ S, — S , — S, + S , , 

b ab bz abz 

Similarly, if we were to eliminate the source A instead of B, we have 

T(B;Z) = S - S, - S + 

b z bz 



Ta(BjZ) - 




+ s 



abz * 



We could equally eliminate the output source Z entirely and consider 

1 

the transmission between A and B; hence. 



It is conventional to speak of transmission from input sources to out- 
put sources. However, the underlying contingency table for a communica- 
tion system does not distinguish between input and output sources, since 
each source is a dimension in the table. Hence we can eliminate any one 
or more sources by summing over the corresponding dimensions in the table. 
When the output source Z is eliminated in this case, we could consider 
temporarily, either source A or source B as the output source. Be- 
cause, by design, we have all combinations of the four grammatical forms 
for both the first clause (source A) and the second clause (source B) , 
T(A;B) * 0. 
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T(A)B) ■ S “ S - S, + S , 

a b ab 

T(A;B)*S “S — S, +S,» 

^ z az bz abz 

The difference between the two ways of eliminating any once source 
is defined as the 3-way interaction information A (ABZ) . It measures 
the amount of transmitted information that is gained (when positive) or 
lost (when negative) between any two sources when the third source is 
also known* If A (ABZ) is zero^ it implies that one of the three sources 
is superfluous, and therefore the 3-dimensional contingency table may be 
reduced to a 2-dimensional contingency table without any loss of general- 
turns out that s'his interaction term is the same no matter which 
particular source is eliminated. Thus, 

A(ABZ) - Tg(A;Z) - T(A;Z) 

« T^(B;Z) - T(B;Z) 

- T2(A;B) - T(A;B) 

--S + S +S. + S - S.-S -S^ + . 

a b z ab az bz abz 

Finally we can relate the 3-dimensional information transmission 
T(AB;Z) with the 2-dimensional information transmission terms, as follows: 
T(AB;Z) - T(A;Z) + T(B;Z) + A(ABZ) 

‘t.. r - * 

- T^(A;Z) + T^(B;Z) - A(ABZ) . 

The Venn diagram in Figure 4-3 also shows the relationship when A(ABZ) 
is positive. 



As our experimental results are best analyzed one frame at a time 
by a 3— input source and 1— output source model, we need to extend the cal- 
culations to include a third input source C. It has C symbols, each 

H 

c 

with a probability p(c) of occurring with p(c) « g p(c) » 1. The 
data will be in 4-dimensional contingency tables where the cell (a,b,c,z) 
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has a number n(a,b,c,z) which corresponds to the number of observations 

^ ^ till tli 

of the event that the 3 / , b^ and c symbols from input sources A, 

til 

B and C respectively are sent out with the z output symbol received. 




Figure 4~3; Components of transmitted information 
in a 2-input and 1-output model. 



ERIC 



When the terms n(a,b,c,z) are summed over any dimension, e.g., dimen* 
Sion A, then the n-terms are denoted as 
n(b,c,z) « E n(a,b,c,z). 

In general, the summed terms are denoted as follows, 

_ 

n * E n(a) *» E. n(a,b) ■ 4-w n(a,b,c) * - n(a,b,c,z). 

SL a|DfC 

The S-terms are defined in a similar way as before, e.g., 



I 



abcz n a,b ,c, z 



n(a,b,c,z) log n(a,b,c,z) 



’abc 



n a^^ n(a,b,c) log n(a,b,c). 



The H-terms are also defined in similar ways as before, e.g., 
H(A,B,C,Z) » - \ ^ p(a,b,c,z) log p(a,b,c,z) 

H ^ D ^ C I Z 



49 







ERIC 






3 



H(A,B,C) 



EL 



a,b, c 



p(a,b,c) log p(a,b,c) 



“abc<"> 



J-' 

- afez Pabc^^> 



* H(A,B,C,Z) - H(A,B,C). 



For this 3-input and 1-output information transmission model, we de- 
fine the transmitted information T(ABC;Z) between the three input 



sources and one output source as follows, 

T(ABC;Z) = H(A,B,C) + H(Z) - H(A,B,C,Z) 



= S- S, -S + S, . 
abc z abcz 



Equivalently, in terms of the noise term 



T(ABC;Z) = H(Z) - H^g^(Z) . 



There are now four 3-way interaction terms, A(ABZ) , A(ACZ) , A(BCZ) and 
A (ABC) , and a 4-way interaction term A (ABCZ) . This 4-way interaction 
term denotes the amount of transmitted information gained (or lost) in 



transmission by controlling a fourth source when any three of the other 
sources are known. Some of the relationships of the 4-dimensional trans- 



mitted information to the 3-dimensional and 2-dimensional transmitted 



information terms are as follows, 

T(ABC;Z) * T(A;Z) -f- T(B;Z) + T(C;Z) -1- A(ABZ) -1- A(ACZ) + A(BCZ) -1- A(ABCZ) 



T(AB;Z) + T(C;Z) + A(ACZ) + A(BCZ) -1- A(ABCZ) 



- T^(AB;Z) T(C;Z) 



* T(A;Z) + T(B;Z) + T^g(C;Z) -1- A(ABZ) . 



The Venn diagram in Figure 4-4 also depicts the various components 
that make up the output information H(Z) . 

For each of the T-terms we can derive the large sample distribution 



by the likelihood ratio test. Suppose in the 3-input and 1-output infor- 



mation transmission model, we want to test the hypothesis that the out- 



put source is independent of the input sources, i.e,. 



p(a,b,c,z) * p(a,b,c) p(z) 
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These probabilities are estimated from sample data in the form of con- 
tingency tables by the maximum likelihood estimator: 
p(a,b,c,z) * n(a,b,c,z)/n 
p(a,b,c) = n(a,b,c)/n 
p(z) = n(z)/n. 




H(Z) 



Figure 4-4: Components of transmitted information 

in a 3-input and 1-output model. 



Therefore the independence of input and output sources can be tested by 
the likelihood ratio X [Wilks, 1962, Chapter 13; Mood, 1950, chapter 12]: 



a,D,c z 



X * 



a.b.L.z 
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-2n I I / , vn(a,b,c) „ . v^Cz) 

" a'b.o i 



-n 1 r f . .n(a,b,c,z) 

XI V • XlCd«D«C«^/ 

a,b,c,2 

If we take the logarithm to the base two on both sides, we obtain, 

log X = >-2n log n + n(a,b,c) log n(a,b,c) + I n(z) log n(z) 

® ® a,D,c z 



n log n + ^ n(a,b,c,z) log n(a,b,c,z) 

a 9 D 9 c ^ z 



= n 



V” I 



c) log n(a,b,c) + ~ | n(z) log n(z) 

XI z 



„ 1. 2^ I t n(a,b,c,z) log n(a,b,c,z) 

n a,b,c,z 

“ 

* -n T(ABC;Z). 

From the theory of large sample distributions, it is known [e.g., Wilks, 
1960, Chapter 13, p. 419] that the quantity -2 In X has an asymptotic 
chi-square distribution with (Sc - 1) (Z - 1) degrees of freedom, 
where In is the natural logarithm and A, 1 , C and Z are the number 

of symbols in the sources A, B, C and Z respectively. Therefore, by 

a change of base for the logarithms, where 2 In 2 = 1.3863, we obtain 
the final expression: 

-2 In X « -2 In 2 log X 

- 1.3863 n T(ABC;Z). 

Similar expressions may be derived for other T-terms. Thus we obtain the 
value for any T-term by multiplying the T-term by 1.3863 n where n 

is the sum of all the terms in a contingency table or the total number of 

responses for a given sentence frame and a group of £s. Therefore if the 
hypothesis 

p(a,b,c,z) a p(a,b,c) p(z) 
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is true, then 1.3863 n T(ABC;Z) is distributed as x^(32) (chi-square 

for 32 degrees of freedom) . For a given choice of probability level 

a, e.g., .01 or .001, we can find the critical points x (for p = .01) 

and x' (for p = .001) such that if the quantity (1.3863 n T(ABC;Z)) > x, 

then the hypothesis is unreasonable, or if (1.3863 n T(ABC;Z)) > x* , the 
hypothesis is highly" unreasonable. When the hypothesis is (or is 
"highly") unreasonable, we can suspect some dependency between the input 
and output sources, or more informally, we suspect that the ^s are re- 
sponding to the cues in the sentences so as to produce a more uniform 
choice of the pronominal referent. 



Chapter 5 



RESULTS 



This chapter describes the results of the experiments with the four 
age groups, from the college sophomore group to the fifth grade group. 

For each age group, the results are analyzed by the proposed three mathe- 
matical models: the Bernoulli trials model, the k-limited transducer 

model and the information transmission model; the presentation of the re- 
sults is followed by an informal discussion which summarizes the findings. 

5.1 The Colleige Sophomore Group 

The basic data for the college sophomore group appear in Tables 5-1, 
5-2, and 5-3. There were 21 male £s with an average age of 19.6 years 
and 20 female _S^s also with an average age of 19.6 years. They were 
students at the University of Michigan and served as paid ^s. Table 5-1 
contains the results of three sentence frames of the sentence type Each 
other ; Table 5-2 contains the results of four sentence frames of the sen- 
tence type He Xed him back ; and Table 5-3 contains the results of four 
sentence frames of the sentence type He Xed him . The results are tabulated 
separately for the male and the female ^s and their pooled data are denoted 
as ’’Total”. The difference between the male and female responses for each 
sentence frame is calculated as a chi-square value. The x (16) (chi- 

square values for 16 degrees of freedom) for the different sentence frames 

2 

appear in the last column (x ) in Table 5-5. An asterisk next to the 
X^(16) denotes that it has exceeded the .01 significance level. Each 
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4x4 matrix represents the 16 possible combinations of sentences of 
a frame where the rows indicate the grammatical forms (A, A, P or P) of 
the first clause and the columns indicate the grammatical forms (A, A, P 
or P) of the second clause. The number in each cell is the number of ^s 
who responded with the grammatical object-slot for that sentence. For 
example, in Table 5-1, the top-left matrix is the response matrix for 21 
male ^s to the nonsense Each other frame. The cell A-A corresponds to 
the sentence John and Bill Yed each other and he Xed him with Y and X 
being nonsense verbs; its cell number 4 indicates that four male Ss re- 
sponded with the grammatical object-slot (Bill) as the referent of he . 

Hence 21 - 4 = 17 male responded with the grammatical subject -slot 
(John) as the referent of he . (Please note that for ease of usage, the 
first-mentioned person (John) is defined as the subject while the second- 
mentioned person (Bill) is defined as the object). 

a. Analysis by the Bernoulli trials model . The maximum likelihood 
estimator p for each sentence frame (i.e. , a matrix) appears at the 
bottom of the matrices. Next to the value of p is the confidence inter- 
val [u,v] where the tail probability a is chosen as .01. Hence by 
assuming a Bernoulli trials model, i.e., all the n ^s selected indepen- 
dently, and with the same probability p of selecting the grammatical 
object-slot as the referent of the pronoun, the probability of obtaining 
Y object-slots is < .01, if Y ^ u or Y ^ y. Those cells whose num- 
bers are ^ u or > v are starred. The sentences corresponding to these 

2 

cells are Interpreted as the unambiguous sentences. Since all the x (1^) 
values for the differences between the male and the female responses are 
not significant, subsequent discussion will pertain to the "Totai’ matrices, 
i.e., the pooled data. 
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Firsts all the sentences of the type Each other in Table 5-1 are 
not starred, and hence they are interpreted as ambiguous, which is what 
we had expected. Apparently whenever the ^s encountered sentences of 

other type, they realized the impossibility of deciding the re- 
ferent, hence approximately three quarters of them just marked an "x" 
on top of the first word (the first word is also the first-mentioned per- 
son, which, for the present study, is denoted as the subject). The gram- 
matical forms of the two clauses and the different verb-pairs did not 
alter the predominantly subject-slot responses. This bias toward the 
subject-slot is reflected by the very low value for the maximum likeli- 
hood estimator p; and as a consequence, the confidence interval would 
cover all the 16 responses in each sentence frame. 

Even in the extreme, if all except one or two £s (out of 41) se- 
lected the subject-slot response (i.e. , the first word) in all 16 sen- 

tences of a frame, it might appear that all 16 sentences should be inter- 
preted as unambiguous. But such biased responses would lead to an estimated 
^ of very small value, and a 98% confidence interval that is certain to 
Include one and two. Thus according to the Bernoulli trials model, all 
these sentences would be considered as ambiguous, since their responses 
can be predicted without regard to the structure (i.e., the grammatical 
forms of the two clauses) of the sentence. Otherwise we are forced to con- 
sider both the following two sentences of different grammatical structure 
as having the same unambiguous Interpretation, i.e., ^ - John :. 

John and Bill Yed each other and he Xed him. [A-A] 

John and Bill Yed each other and he was Xed by him. [A-P] 

But this leads to an explicit contradiction, i.e., if ”he « John” is ac- 
cepted as the correct interpretation for one sentence, then * John” 
cannot be accepted as the correct interpretation for the other sentence. 
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because the passive transforaation in the second clause must interchange 
the referents of he and him. The explicit contradiction would be much 
more clear in the following pair of sentences of the same grammatical struc- 
ture as before, but taken from the He Xed him back frames 

John Yed Bill and he Xed him back. [A-A] 

John Yed Bill and he was Xed back by him. [A-P] 

It seems reasonable to require that the set of unambiguously interpreted 
sentences does not contain explicit contradictions. Thus if "he “ 
is the acceptable interpretation for one of these sentences, then "he = 

John” cannot be the acceptable interpretation for the other sentence. Sim- 
ilarly, the interpretation "he ® Bill" cannot be acceptable for both sen- 
tences. Hence, even though the group response may be uniform, they are 
made without regard for the grammatical structure of the sentence, and those 

sentences must be classified as ambiguous. 

Now for the responses from the sentence type He Xed him back , which 
appear in Table 5-2, almost all the sentences are unambiguously interpreted. 
Contrary to the Each other sentence type, the grammatical forms of the two 
clauses largely determine the referents in this sentence type. 

The word "back" was intended to switch the actors of the two actions, 
i.e., the subject of the first clause becomes the object of the second 
clause or the object of the first clause becomes the subject of the second 
clause. However, in a passive transformation, the grammatical subject (in 
the surface structure) becomes the logical object (in the deep structure), 
and likewise, the grammatical object becomes the logical subject. There- 
fore the switch of subject and object must be done after the two clauses 
are transformed into their respective deep structures, i.e., we are switch- 
ing the logical subject and the logical object. Since the negative con- 
struction does not alter the relationship between the subject and the object. 
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the referent of lie is expected to be invariant under the negative trans- 
formation. Hence with respect to the matrices, we would expect the four 
top-left cells (A-A, A-A, A-A and A-A) to have the same referent, the 
four top-right cells (A-P, A-P, A-P and A-P) to have the same referent 
(though not the same as the top- left ones) , the four bottom-left cells 
(P-A, P-A, P-A and P-A) to have the same referent, and finally the four 
bottom-right cells (P-P, P-P, P-P and P-P) to have the same referent. In 
addition, since the passive transformation effectively interchanges the 
subject-object relationship, its effect on the pronominal referent is ex- 
pected to be the same whether the passive transformation occurs in the 
first or the second clause. Therefore we would expect the eight cells in 
the minor diagonal (i.e., the four top-right cells and the four bottom- 
left cells) to have the same referent. Likewise, when both clauses have 
the passive transformation, as in the four bottom-right cells, we would 
expect the two interchanges of the subject-object relationship to cancel 
out. Hence the referent for the eight cells in the major diagonal (i.e., 
the four top- left cells and the four bottom-right cells) is expected to be 
the same. 

Except for the logical reversal frame, the response matrices of the 
other three sentence frames can be partitioned into four corner submatrices, 
each with a relatively homogeneous selection of the pronominal referent. 

Such partition of the total matrices confirms the expectation that the pro- 
nominal referent is invariant under the negative transformation. Further, 
the effect of switching the subject-object relationship by the passive trans- 
formation is confirmed by the observed "diagonal" response, i.e., the cells 
of the major diagonal have the same pronominal referent and the cells of 
the minor diagonal have the same pronominal referent (but different from 
the major diagonal). Hence it is reasonable to assume that the ^s based 

o 
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their selection on the deep structure rather than the surface structure 
of the sentences. The fact that the cells of the major diagonal have un- 
ambiguous object-slot responses (the numbers are ^ v) and the cells of 
the minor diagonal have unambiguous subject-slot responses (the numbers 
are ^ u) supports the hypothesis that the intended meaning of "back" was 
indeed used by the ^ in selecting the referent for the pronoun he . The 
intended meaning was to interchange the actors in the two clauses, i.e. , 
the logical subject of the first clause becomes the logical object of the 
second clause and the logical object of the first clause becomes the logical 
subject of the second clause. 

The four top-left cells correspond to sentences with both clauses in 
the active forms. The pronoun h^ is the logical subject of the second 
clause, hence its expected referent would be the logical object of the first 
clause. Since the first clause is in the active form, its logical object 
is the grammatical object. The four bottom— right cells correspond to sen- 
tences with both clauses in the passive forms. Here the pronoun hje is the 
logical object of the second clause, hence its expected referent would be 
the logical subject of the first clause. Since the first clause is in the 
passive form, its logical subject is the grammatical object. These eight 
cells of the major diagonal, with their cell numbers ^ v, are unambiguously 
interpreted in the object-slot. Thus the cells in the major diagonal are 
in agreement with the expected object-slot responses. The four top-right 
cells correspond to sentences whose first clause is in the active and the 
second in the passive. The pronoun is the logical object of the second 
clause, hence its expected referent would be the logical subject of the 
first clause. Since the first clause is active, its logical subject is its 
grammatical subject. The four bottom-left cells correspond to sentences 
whose first clause is passive and its second clause active. Here the pronoun 
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he is the logical subject in the second clause, hence its expected refer- 
ent would be the logical object of the first clause. Since the first clause 
is in the passive form, its logical object is its grammatical subject. 

These eight cells of the minor diagonal, with the cell numbers < u, are 
unambiguously interpreted in the subject-slot. Thus the cells in the minor 
diagonal are also in agreement with the expected subject-slot responses. 
There are five sentences in the nonsense frame and one sentence in the sim- 
ilar frame which did not attain the .01 significance level, and hence 
they remained as ambiguous sentences. These belong to the more syntacti- 
cally complex sentences which possibly exceeded some ^s* capacity for sen- 
tence processing. These will be taken up in the following section on the 
k-limited transducer model. 

The results for the logical reversal frame are more difficult to ex- 
plain. There is still the approximate partition of the matrix into four 
comer submatrices, but in each submatrix, there is always one cell number 
which is far different from the other three cell numbers. Again the "diag- 
i^fisponse is observed, save for one cell in each submatrix. Finally, 
the intended response reversal due to the use of logical reversal verbs is 
obtained, ^.e., the cells of the major diagonal are unambiguously inter- 
preted in the subject-slot while the cells of the minor diagonal are unam- 
biguously interpreted in the object-slot. (Recall that for the other three 
sentence frames, the cells of the major diagonal are unambiguously inter- 
preted in the object-slot while the cells of the minor diagonal are unam- 
biguously interpreted in the subject-slot). The cells which differ greatly 
from the other three cells in each submatrix are A-A, A-P, F-A and P-P. 
These "deviant" cells correspond to sentences with the first clause expressed 
in the negative (A or P) while the second clause is expressed in the af- 
firmative (A or P) . The conjunction used is always but . However, the 
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deviance cannot be attributed to the use of but, since there are 
three other constructions using the conjunction but « A-A, P-A and 
P-P, and yet their responses were unambiguous. Nor can the deviance 
be attributed to the particular verb-pairs used in these four cells, 
since the same verb-pairs, when used in other constructions, give 
unambiguous responses. But the fact that the sentences corresponding 
to these "deviant” cells are combinations of negative and affirmative 
clauses makes the following explanation plausible. 

The set of logical reversal verb-pairs would produce the "reversal" 
responses if we assume that the first clause stimulated the ^s to create 
a mental image of a previous action in which the second clause is a 
reply or consequence of that previous action. But when the first clause 
is negative and the second clause affirmative, it is very difficult 
to create a "nonoccurring" previous action such that the affirmative 
second clause is a reply or consequence. Hence, to some S_s, the verb- 
pairs in these sentences had the same effect as if the verbs were sim- 
ilar. In fact, the sentence which corresponded to A-P, John did not 
remember Bill but he was phoned back by him , was unambiguously inter- 
preted in the subject-slot (he * John), which would have been the expected 
response if <remember, phone> did not belong to the class of logical 
reversal verb-pairs. 

Finally we come to the responses for the He Xed him type, which 
appear in Table 5-3. The only difference between the present sentence 
type and the He Xed him back type is the lack of the word "back". But 
without such an explicit word to force the most likely interpretation 
of "to interchange the actors in the two actions", the pronominal ref- 
erents of sentences in the He Xed him type are less determinant. 





Starred cell (*) has p < .01 
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For the nonsense frame where the Ss could only rely on the gram- 
matical form of the two clauses, only four of the 16 sentences 
attained the ,01 level of being unambiguously interpreted. In fact, 
the responses were so non-uniform that even the expected partition of 
the nonsense response matrix into the four corner submatrices was not 
clear. The estimated is smaller than ,50, which implies that when 
the ^s were unable to decide on the referent, the first word (the 
first-mentioned person) was selected. This is clearly the same type 

of bias that was observed in the completely ambiguous sentence type 
Each other . 

In the other extreme, when the verbs were from the class of iden- 
tical verb-pairs, all the 16 sentences were unambiguously interpreted. 
Here the meaning of the verbs provided some real-world situations to 
help resolve the pronominal reference. The fact that the two verbs 
were identical probably also helped the Ss to select the most probable 
interpretation, i.e., as in the He Xed him back type, "to interchange 
the two actors in the two actions". In fact the results showed a 
"diagonal" response in which the cells of the major diagonal were in- 
terpreted in the object— slot while the cells of the minor diagonal 
were interpreted in the subject-slot. The alternative choice would 
result in one of the following four interpretations: 

1) the two clauses of the sentence are exact repetitions of each 
other, e.g., in the A-A construction (or A-A, P-P, P-P) , 

John Xed Bill and he (John) Xed himj 

2) the two clauses are repetitions in a different grammatical form, 
e.g., in the A-P construction (or A-P, P-A, P-A) , 

John Xed Bill and he (Bill) was Xed by himj 
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3) the two clauses are contradictory of each other, e.g., in 
the A-A construction (or A-A, P-P, P-P) , 

John Xed Bill but he (John) did not X him; 

4) the two clauses are contradictory of each other, but expressed 
in different grammatical forms, e.g., in the A~P construction 
(or A-P, P-A, P-A). 

John Xed Bill but he (Bill) was not Xed by him. 

The first two interpretations are acceptable, emphasizing the fact that 
John Xed Bill , but the constructions are very awkward. The latter two 
Interpretations give contradictory statements about "who Xed whom", if 
we assume John and Bill are the only two persons involved. Therefore, 
the lack of a reasonable alternative Interpretation makes the interpre- 
tation "to interchange the two actors in the two actions" the one uni- 
formly adopted. 

For the class of similar verb-pairs, only ten of the 16 sentences 
were interpreted unambiguously. Because the two verbs are different, it 
is possible to interpret the sentences in both ways, i.e., either "the 
same person performed both actions" or "each person performed one action". 
However the latter interpretation is the more likely one, because there 
exists a better construction to express the fact that "the same person 
performed both actions", viz., John Yed and Xed Bill . In this frame, 
there is an overt partition of the response matrix into four corner 
submatrices. Under this partition, the cells of the major diagonal are 
Interpreted in the object-slot (though the data for some of the cells 
did not attain the .01 level) while the cells of the minor diagonal 
are interpreted in the subject-slot (through the data for some did not 
attain the .01 level). Such an observed "diagonal" response substant- 
iates that the interpretation "each person performed one action" was 
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indeed the more plausible one. It is strange, however, that the A-A 
and the A—A sentences turned out to be ambiguous, since these are 
the most simple constructions. The writer cannot offer any logical 
lana^-ions , except sampling error. This effect does not occur with 
other age groups, nor does it occur for the other sentence frames in 
the college sophomore group. 

Finally for the logical reversal verbs, the responses were very 
much like those for the He Xed him back frame. Since this class of 
verbs depended on the creation of a prior action by the "inward" verbs 
in the first clause in which the second clause is a reply to that 
prior action, the word "back" did not add any more information. Again 
when the sentences were constructed from a negative clause (A or ¥) 
followed by an affirmative clause (A or P) , the causal stimulus for the 
creation of a prior action was absent. Hence the responses to these 
four sentences are either random (pronomianal referent remained am- 
biguous) or attained the .01 level (only cell A-P) , but not with the 
"reversal" interpretation. Again, except for these four "deviant" 
cells, the total response matrix can be partitioned into four corner 
submatrices, where the major diagonal is unambiguously interpreted in 
the subject-slot while the minor diagonal is unambiguously interpreted 
in the object-slot, i.e. , the expected "reversal" way. 

-Analysis by t he k-limlted transducer model . Since we are 
dealing with one age group, this model is inteded to account for the 
observed degeneration (i.e., tendency toward random responding) among 
the 16 sentences in a frame. Since all the sentences in the Each 
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other type were responded to randomly, we cannot talk about degenera- 
tion. As for the other two sentence types, since there is a partition 
of every response matrix into four submatrices, each with relatively 
homogeneous numbers, we shall consider the degeneration in the four 
corners. As a rough measure of this degeneration, we shall use the 
number of starred cells in each corner submatrix, i.e,, the number of 
sentences that were *“ '^ponded to unambiguously. Table 5-4 gives the 
number of starred cells in each corner submatrix for the different 
frames in the sentence types He Xed him and He Xed him back . The 

Table 5-4 

Number of starred cells in each corner submatrix 
for the college sophomore subjects 

Nonsense Similar Log, Rev. Identical 
He Xed him 2, 1,0,1 2, 2, 3, 3 3, 2, 3, 3 4, 4, 4, 4 

He Xed him back 4, 4, 2,1 4, 4, 4, 3 3, 2, 3, 3 4, 4, 4, 4 



4-tuple (e,f,g,h) denotes that there are e starred cells in the top- 
left corner (active-active submatrix) , f starred cells in the top- 
right corner (active-passive submatrix) , g starred cells in the 
bottom-left corner (passive-active submatrix) , and h starred cells 
in the bottom-right corner (passive-passive submatrix) . For the logi- 
cal reversal frames, we do not count the "deviant" cells, hence the 
maximum value for each element of the 4-tuple is three. 

We observe a nonincreasing relationship for the elements of the 
4-tuple (e,f,g,h) across all four age groups, i.e., e ^ f > g ^ h. 

This relationship confirms our expectation that the passive transforma- 
tion is the major contributor to the complexity in a sentence. In terms 
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of the k-limited transducer model, the needed k for correct de- 
coding increases from sentences with active-active clauses to sen- 
tences with active-passive clauses to sentences with passive-active 
clauses to sentences with passive-passive clauses. Ihe fact that sen- 
tences with active-passive clauses are generally less ambiguous than 
sentences with passive-active clauses (i.e., f>g), though all of 
the same length, is explained by including the processing as part of 
the k. The passive construction in the first clause requires com- 
plete decoding while the passive construction in the second clause 
needs only to be decoded to obtain the logical function (subject or 
object in the deep structure) of the pronoun he. 

The exceptions to the nonincreasing relationship come from ele- 
ment f in the two logical reversal frames and the similar He Xed him 
frame, and element h from the nonsense He Xed him frame. For the 
logical reversal frames, the cell A-P was not starred, but the "de- 
viant" A-P cell was (both belong to the active-passive submatrix) , 
a fact which is also true for the eighth grade S_s. Thus it appears 
that for the four cells in the active-passive submatrix, the sentence 
for cell A-P was interpreted as ambiguous while the sentence for the 
"deviant" cell A-P was interpreted as unambiguous, but not in the 
"reversal" way. Thus the lower value for element f was partly due to 
our exclusion of the "deviant" cells. As for the He Xed him frame, the 
lower values in elements e and f were due to the peculiar responses 
of the £s for this group, as observed in the previous section. The 
higher value of h « 1 (compared to g « 0) in the nonsense He Xed him 
frame is probably a sampling error caused by the particular confidence 




interval. 
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In comparing the starred cells for the two sentence types, we 
note that both for the identical and logical reversal frames, the 
elements of 4-tuples (e,f ,g,h) are maximal, i.e. , all have the 
number 4 (3 for logical reversal frames). For the nonsense and 

similar frames, however, even the element e (active- active com- 
bination) has not attained the maximal value in the He Xed him sen- 
tence type. This implies that the two sentence types were responded 
to alike (see Table 5-6 and the discussion in section 5.1.c) when the 
verb-pairs were from the class of identical or logical reversal verbs, 
and were responded to quite differently when the verb-pairs were from 
the class of nonsense and similar verbs. . 

The word "back" in the He Xed him back sentence type make the in- 
terpretation "to interchange the two actors in the two actions" so 
forceful that nearly all the sentences in the four frames were unam- 
biguously interpreted. The "program" for such an interpretation is 
quite simple, since the only knowledge needed is the subject-object 
relationship; hence it is reasonable to assume that none of the de- 
coded messages had exceeded the assignment capacity k (second trans- 
ducer) . But in the decoding stage (first transducer) the decoding 
capacity k could be exceeded when sentences contained passive con- 
structions. Thus we see that the values of e are maximal for all 
four frames, reflecting the fact that as long as the sentences were 
decoded correctly, they were processed correctly. However, if a 
sentence contained passive constructions, the _Ss could very easily 
make the mistake of transforming it to its active form without inter- 
changing the two persons, for example, John was Yed by Bill into 
John Yed Bill. This type of error could easily occur because both 
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John and Bill possessed the same lexical features, [Roberts, 1966] or 
that the sentence was reversible [Slobin, 1966]. That is, given any 
verb, real or nonsensical, it was possible to say either John Yed Bill 
or Bill Yed John . In contrast, for example, given the verb "hit" and 
the two nouns "John* and "dog", it is possible to say John hit the dog 
but not The dog hit John . In particular, when the verb was nonsensical, 
there was no semantical criterion to guide the transformation from 
passive to active so that the possibility of a decoding error became 
greater. Indeed the 4-tuple (4, 4, 2,1) for the nonsense He Xed him back 
frame indicated a sharp degeneration as the first clause became a pas- 
sive construction. 

As for the He Xed him frame, the 4- tuples for the identical and 
logical reversal frames attained the maximal values. This indicated 
that sentences of these two frames were correctly decoded (within the 
limits of the k of the first transducer kT) and correctly processed 
(within the limits of the k' of the second transducer k'T). How- 
ever when the verbs were from the classes of similar and nonsense verb- 
pairs, even the most simple constructions (the active-active submatrix) 
did not attain the maximal value, i.e. , e< 4, This suggests that even 
the correctly decoded messages were not processed correctly, i.e., the 
k’ of the second k'T was exceeded due to a more complex "program" for 
assigning the pronomial referent. This "program" must take into account 
the conventional usage as wll as the relationship between the two actions, 
but neither criterion was usable when the verbs were nonsensical. 

Analysis by the information transmission model * The 11 sen- 
tence frames are separately analyzed by the 3-input and 1-output inf or- 
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mation transmission model. The results appear in Table 5-5, where the 
first column H(Z) is the total information or uncertainty in bits in 
the output source Z, The next seven columns, the T— terms, are trans- 
mitted information terms as calculated in percent of H(Z) . The next 
four columns, the A-terms, are the interaction terms, also calculated 
in percent of H(Z) . The last column, x __ » is the chi-square value 

with 16 degrees of freedom for the response difference by the male and 
femal.e ^s in the same age group. A, B and C are input sources, where 
A denotes the grammatical forms of the first clause, B denotes the 
grammatical forms of the second clause and C denotes the sex of the 
£ who is responding. T(ABC) is short for T(ABC;Z) and denotes the 
transmitted information when all three input sources are known. T(AB) 
is short for T(AB;Z) and denotes the transmitted information when 
the A and B sources are known but summed over the dimension C. 

T(AB)/M and T(AB)/F are transmitted information T(AB;Z) calculated 
respectively for only the male and female responsas. T(A) , T(B) and 
T(C) are short for T(A;Z), T(B;Z) and T(C;Z), respectively, and 
they denote the transmitted information when only one of the input 
sources is known. For each T-term, the number of degrees of freedom 
is given immediately below it. The relationship of the interaction 
and the transmitted information terms are shown in the Venn diagram in 
Figure 4-4. 

2 

Let x(n) and x* (n) be two critical values such that P[x > x(n) ] 
< .01 and P[x^ > x* (n) ] < .001 for a chi-square distribution of n 
degrees of freedom. Since the T-terms have a limiting chi-square dis- 
tribution, their x^ values are calculated, but are not shown in Table 



Components of transmitted information for 21 male and 20 female 

college sophomore subjects 
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Checked item (*^) has p < .001 
Starred item (*) has p < .01 



5-5. However, when the value for a T-term exceeds x(n) , that 
T-term has an asterisk (*) next to the term, and if the value ex 

ceeds x' (n) , that T-term has a check ( /) next to the term. The 

^^sex starred or checked according to whether their 

values exceed x(16) or x' (16) respectively. 

The interpretation of these starred and checked T-terms is the 
following. If the input sources, either collectively or singly, do 
not influence the output responses, then the probability of obtaining 
a T-term of the magnitude shown by a starred item is less than ,01, 
and similarly the probability would be less than .001 for a checked 
item. Hence, the null hypothesis that there is no influence between 
the input and output sources is untenable for the starred and checked 
items. Therefore it is safe to assume that the pronominal referent 
is conveyed by the input sources for these T-terms. The interpreta- 
tion of a starred or checked item for x^ (last column) is that 

S 03C 

the responses from the male and female _Ss are different. We will say 
that a term is highly significant when it has a check (/) , i.e., the 
probability of obtaining such a magnitude under the null hypothesis 
for the term is <,001 (p <,001), or that a term is significant when 
it has an asterisk (*), i.e., the probability of obtaining such a 
magnitude under the null hypothesis is < .01 (p < .01). 

The analysis by the information transmission model supplements 
the analysis by the Bernoulli trials model and the k- limited trans- 
ducer model for each sentence frame. These two models analyze the 
uniformity or non-uniformity of responses for each sentence of the 
frame while the infonnation transmission model analyzes the dependency 
between any of the input sources and the output source. This depen- 
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dency is quantified by the T-terms, which is a different measure of 
the uniformity of responses, as an average of the 16 responses in a 
sentence frame. 

Table 5-5 shows that none of the T-terms, except T(C;Z), for the 
three frames of the Each other sentence type has attained the .01 
significance level. This result substantiates the previous finding 
(by the Bernoulli trials model) that every sentence in all three sen- 
tence frames is ambiguously interpreted. That is, both the 16 sen- 
tences individually, and collectively for each frame, were responded to 
in a random manner, and hence they are interpreted as ambiguous. The 
T(CjZ) terms are starred or checked, because there is some overall bias 
in the male and female responses when the responses are summed over the 
A and B sources, i.e., when the individual sentences are not disting- 
uishable. However, when the individual sentences are distinguishable, 
the male and female responses are not significantly different, as shown 
by the for 16 degrees of freedom. 

For the He Xed him and the He Xed him back types, the T-terms are 
all highly significant (p < .001) when either two inputs, A and B, 

are known or all three inputs. A, B and C, are known. In contrast, 
the T-terms when only one source is known are not significant (except 
for T(B;Z) in the nonsense He Xed him back frame, which is probably 
a sampling error) . The interpretation of these highly significant and 
nonsignificant T-terms is the following: when only one input source 

is known, whether it be the grammatical form of the first clause (source 
A) or the grammatical form of the second clause (source B) or the sex 
of the _^s (source C) , it is not possible to predict the pronominal 
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referent (output source Z) ; but if the grammatical forms of both the 
first and second clauses are known (i.e., the structure of the sentence 
is known), with or without knowing the sex of the ^s, we can predict 
much better than chance the pronominal referent of the lie in a sentence. 
The fact that we can predict the pronominal referent when the structure 
of the sentence is known substantiates our previous finding with the 
Bernoulli trials models that most of the individual sentences were unam- 
biguous. Even though only four sentences out of the 16 sentences in 
the nonsense He Xed him frame were unambiguous according to the Bernoulli 
trials model, the T(AB;Z) for the same frame is highly significant. 

This says that, individually, many sentences of the nonsense He Xed him 
frame have ambiguous pronominal referents, but collectively (since the 
information transmission model measures the expected or average values) 
the pronominal referent can be predicted much better than chance. The 
high interaction term, A(ABZ) , means that both sources A and B 
are needed for producing an output response from Z. In fact, T(A;Z) 
and T(B;Z) are nonsignificant, which means the knowledge of the gram- 
matical form of one of the two clauses of a sentence is not sufficient 
to predict the pronominal referent. By comparing the transmitted infom- 
ation T(AB)/M and T(AB)/F, there is no indication that either the 
male or the female ^s could perform better than the other sex. 

The size of the T-terms may be a little misleading, since a T(AB;Z) 
of 8% implies that 92% of the total uncertainty arises from the 
uncontrolled noise component, and yet T(AB:Z) is highly significant. 

In Appendix C, Table C-1, we have computed the T(AB;Z) term for an 
ideal group of n Ss , all but m of whom responded uniformly in a 
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diagonal way, viz, n - m responded with the object-slot in the major 
diagonal and m ^s responded with the object-slot in the minor diagonal. 
T(AB;Z) is a monotonically decreasing function of m. Thus for a total 
of 41 ^s, the 8% for T(AB;Z) corresponds to an m of 15, i.e., 

15 out of 41 ^s, or 37% of the^s, did not agree in their responses. 

Thus the highly significant results come from the fact that for the large 
sample of responses (16 x 41 = 656) , a 63—37 division is sxgnificantly 
different from the expected 5G— 50 divxsxon for random responding. The 
highest T(AB;Z) term is 52% which corresponds to an m of 4, sig- 
nifying that approximately 10% of the ^s did not agree in their responses. 
Since the college sophomore group is the oldest group in the present 
experiment, then we can expect at most 90% agreement among the 41 Ss 
with adult linguistic competence. 

Besides determining the significance (asterisks or checks) of 
the T-temis, we wish to know whether the ^s based their responses on 
the sentences type or on the particular verb— pairs . Thus we want to 
find whether the responses from two classes of verb-pairs in each sen- 
tence type were different, and whether the responses from two sentence 
types for each class of verb-pairs were different. We can not compare 
the magnitudes of the T-terms, since each is calculated as an average 
of the responses to each frame. Hence two T-terms may be close when 
their responses are very different, as between the responses of the 
nonsense He Xed him back and the logical reversal He Xed him back frames. 
The T(AB;Z) terms are respectively 31.85% and 32.54%, yet from the 
response matrices, the major and minor diagonals are reversed with respect 
to each other. 




A better measure is to compare the response matrices , that is as 
in the case of the sex difference, to calculate the chi-square differences 
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between the respective cells of the two matrices. Since the sex differ- 
ence was found to be nonsignificant, we are now comparing the Total ma 
trices. Table 5-6 tabulates the x^(16) values between any two sentence 
types for each class of verb-pairs and Table 5-7 tabulates the X^(16) 
values between any two classes of verb-pairs for each sentence type. Again 
the checked item Indicates that its x^ value exceeds x* (16) and the 
starred item indicates that its x^ value exceeds x(l6) where 
P[X^ > x'(16)] < .001 and P[x^ > *(16)] < -Ol- Table 5-6 reveals that 
college sophomores responded alike to sentence types He Xed hi m and He X^d 
him back when the verbs were Identical or logical reversal. That is, for 
these two classes of verb-pairs, the presence or absence of the word "back" 
made no difference to the S.s. However when the verb-pairs were from the 
classes of the nonsense and the similar verbs, the word "back" made a 
great deal of difference. It should not be surprising that the S,s treated 
the Each other sentence type differently from either the He Xed him type 
or the He Xed him back type, no matter what the verb-pairs were. 

Table 5-7 gives the x^(16) values between any two verb-pairs in 
each sentence type. The column heading HS-SI is for the comparison 
between the nonsense (N3) and the similar (SI) verb-pa- 's; the col- 
umn heading NS-IR is between the nonsense and the log xL reversal 
(LR) verb-pairs ! the column heading NS-ID is between the nonsense 
and the identical (ID) verb-pairs, etc. As expected, there was no 
difference in responding for the Each other: sentence type between any 
two classes of verb-pairs, since all were random responses. For the 
He Xed back sentence type, the ^s responded alike to sentences using 
either nonsense or similar verb-pairs and to sentences using either sim- 
ilar or identical verb-pairs (but not to sentences using either nonsense 



Table 5-6 



Chi-square values between any two sentence types for each 
class of verb-pairs for the college sophomore subjects. 





Xed him 


Xed him 


Xed him back 




vs 


vs 


vs 


Xed him back 


Each other 


Each other 


Nonsense verbs 


94.5/ 


57.3/ 


197.8/ 


Similar verbs 


42.2/ 


157.5/ 


223.2/ 


Log. Rev. verbs 


9.8 


144.0/ 


160.8/ 


Identical verbs 


18,9 






Checked item (/) has 


p < .001 







Table 5-7 



Chi-square values between any two verb-pairs in each 
‘.aentence type for the college sophomore subjects. 









NS-SI 


NS-LR 


NS-ID 


SI-LR 


LI-ID 


LR-ID 


He 


Xed 


him 


83.6/ 


188.3/ 


199.5/ 


319.4/ 


71.4/ 


530,0/ 


He 


Xed 


him back 


22,3 


395.3/ 


41.6/ 


471.2/ 


16,3 


534.8/ 



Each other 3.8 4.3 4.8 



p 



Checked item (v^) has 



.001 



or identical verb-pairs). But this was not true for the He Xed him 



sentence type, since in the absence of the word "back", the verbs must 
carry the "message" about the pronominal referent. Thus each frame in 
the He Xed him type was responded to differently from any of the other 
frames. In addition, the extremely large values for the NS-LR, 

SI-LR and LR-ID pairs indicated that the logical reversal verb— pairs 
were always treated differently from the other verb-pairs in both the 
He Xed him and He Xed him back types, 

d. Summary . The results from the college sophomores group conclu- 
sively demonstrated that subjects could resolve an ambiguity in a sen- 
tence by reducing the meaning of a key word (here the pronoun he) from 
two to one. Although Kaplan [1949] demonstrated that reduction of the 
senses of a key word was possible through its immediate neighboring words, 
the present experiment demonstrated that, for some sentences, any num- 
ber of neighboring words, short of the entire sentence, could not resolve 
the ambiguity. That is, the resolution of the ambiguous referent depended 
on the structure (sentence type and grammatical form) of the sentence. 
However, if a sentence had an inherently ambiguous structure, such as 
that of the Each other type, it was demonstrated that the sentence re- 
mained ambiguous with any verb-pairs and under any grammatical forms. 

It seems safe to generalize this finding to say that some sentences 
would remain ambiguous given all the available contexts in a paragraph 
of chapter. 

The Bernoulli trials model partitioned the 16 sentences in each 
frame into two mutually exclusive sets ; those that remained ambiguous 

and those that had resolved the pronominal ambiguity. With this model 
it was found, essentially, that all the sentences in any frame of the 
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— " ambiguous, while nearly all the sentences in any 

frame of the other two types were unambiguous. Then the k-limited 
transducer model accounted for the degenerative responses by a sequence 
of two k-limited transducers j the decoding transducer (with k units) 
and the assignment transducer (with k* units). The degeneration, then, 
was due either to having exceeded the k units while decoding a sen- 
tence or to having exceeded the k’ units while making the assignment 
for the pronoun. Finally the information transmission model computed 
the dependency between the input sources and the output source in terms 
of the transmitted Information (T-terms) . With the model it was found 
essentially that the dependency was absent for the Each other sentence 
type, but was highly significant when two or three input sources were 
known for both the He. Xed him and the He Xed him back sentence types. 

5.2 The Eighth Grade Group 

The basic data for the eighth grade group appear in Tables 5-8, 

5-9 and 5-10. There were 26 male ^s with an average of 13.5 years 
and an average I.Q. score of 117, and 28 female ^s with an average 
age also of 13.5 years and an average I,Q. score of 116. The data 
are pooled responses from two classes, both from the West Junior High 
School in Ypsllanti, Michigan. Table 5-8 contains the results of 
three sentence frames of the Each other type; Table 5-9 contains the 
results of four sentence frames of the He Xed him back type; and Table 
5-10 contains the results of four sentence frames of the He Xed him type. 
The results are tabulated separately for the male and the female ^s, and 
their pooled data are denoted as "Total". The difference between the 
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male and female responses for each sentence frame Is calculated as a 
chi-square value, which appears in the last column (under in 

Table 5-12. Only the similar He Xed him frame showed a significant 
difference (p < .01) between the male and the female responses. 

As with the college sophomore group, each response matrix in Tables 
5-8, 5-9 and 5-10 represents the 16 possible sentences in a sentence 
frame. The rows of each response matrix denote the grammatical forms of 
the second clause. Each cell of the response matrices corresponds to a 
particular sentence, and its cell number is the number of ^s who selected 
the grammatical object-slot as the referent of the pronoun he. For the 
nonsense frames, there were 25 male Ss instead of 26. 

a. Analysis by the Bernoulli trials model . The maximum likelihood 
estimator* p for each sentence frame (a matrix) appears at the bottom of 
the matrix. Next to the value of p is the confidence Interval [u,v] 
where the probability a is chosen as .01. Hence if we assume a Bernoulli 
trials model, i.e., all the n ^s selected the grammatical object-slot 
as the referent of the pronoun he Independently and with the same proba- 
bility then the probability of obtaining Y object-slots is < .01, 

if Y ^ u or Y >v. Those cells whose numbers are ^ u or > v are 
starred. The sentences correspondingr to these cells are interpreted as 
the unambiguous sentences. Let us see if this interpretation is reason- 
able. 

First, none of the cells of the response metrics in Table 5-8 are 
starred; nence all the sentences belonging to the Each other type are 
interpreted as ambiguous, which is what we had expected. Since the x^(16) 
for the male and female differences are all nonsignificatn, we shall dis- 
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CUSS only the pooled data. Since the estimated values of p vary from 
.43 to .46 (as compared to a variation from .26 to .28 for the 
college sophomore group), there is no marked bias toward the subject- 
slot. For this group there was no uniform responding when the ^s re- 
cognized the impossibility of deciding the selection of the pronominal 
referent. There is no question that the ^s had recognized the ambiguity 
in these sentences, since their responses to the other two sentence types 
were polarized, i.e., subject-slot responding (low numbers in the matrices) 
for one set of sentences and object-slot responding (high numbers in the 
matrices) for the remaining set of sentences. 

As for the responses to the sentence type He Xed him back , which 
appear in Table 5-9, none of the four response matrices was completely 
starred. However, the responses were definitely non- random, especially 
when the verb-pairs were identical. In contrast to the Each other type, 
the word "back" in the He Xed him back type provided the cues for a most 
probable interpretation, i.e., "to interchange the actors of the two 
actions". Let us see if the responses were consistent with this expected 
interpretation. Again since the male and the female responses were not 
significantly different, only their pooled data are considered. 

For the identical frame, the response matrix may be partitioned into 
four corner submatrices, each with relatively homogeneous numbers. Thus 
the responses were invariant under the negative transformation. In add- 
ition, we observe that the responses were "diagonal", that is, the cells 
q£ major diagonal (the four top— left cells and the four bottom— right 
cells) have one response while the cells of the minor diagonal (the four 
top-right cells and the four bottom-left cells) have the other response. 
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This observed "diagonal" response suggests that the ^s selected the 
pronominal referent on the basis of the deep structure rather than the 
surface structure of the sentences. Further, the fact that the cells 
of the major diagonal were largely responded unambiguously in the ob- 
ject-slot and that all the cells of the minor diagonal were responded 
unambiguously in the subject-slot was consistent with the intended 
interpretation for the word "back". 

There are not as many starred cells for the similar frame as for 
the identical frame. However, there is still a clear partition of the 
response matrix into four comer submatrices. Thus the responses were 
invariant under the negative transformation. All the starred cells 
form part of a "diagonal" response, which is consistent with the assump- 
tion that ^s responded to the deep structure rather than the surface 
structure of the sentences. Further the starred cells in the major di- 
agonal were object-slot responses (cell number ^ v) while the starred 
cells in the minor diagonal were subject->slot responses (cell number 4 u) . 
Thus the interpretation used for the similar frame was also "to inter- 
change the two actors of the two actions", which is consistent with the 
word "back". 

For the nonsense frame, however, it is meaningless to partition the 
matrix since most of the sentences elicited random responding. Of the 
four starred cells three came from the most simple constructions, i.e. , 
when both clauses of the sentence were active (A or A) , while the re- 
maining one came from the A-P construction. The next section will 
describe more about this degenerative behavior. 

As for the logical reversal frame, we again observe the same four 
"deviant" cells that occurred for the college sophomore group. These are 
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til© cslls conTSsponding to ssntsncss wlios© fi.irst cl©us6 is nsgstiv© (A or 
P) ©nd whose second clause is affirmative (A or P) • In fact) two of the 
four cells (X-A and A-P) attained the .01 significance level not in 
the expected "reversal" way. That is, the negative-affirmative constructions 
with the logical reversal verb-pairs made the creation of a "non-occurring" 
previous action so difficult that the Ss effectively treated the verbs as 
if they belonged to the class of similar verbs. Besides the four "deviant" 
cells, the remaining ones can be partitioned into corner submatrices. Fur- 
ther, the expected "reversal" response is observed, since the major diagonal 
has the subject-slot responding while the minor diagonal has the object-slot 

responding. 

For the sentence type He Xed him , we have the response matrices shown 
in Table 5-3.0, Basically the results are not very different from those 
shown in Table 5-9 for the He Xed him back type. The responses, however, 
are degenerated, from nearly all unambiguous responding, for the identical 
frame, to nearly random responding for the nonsense frame. Only the sim- 
ilar frame showed a significant difference between the male and the female 
responses. However, for ease of discussion, we shall still pool the male 
and the female responses, and consider the "Total" matrices. 

The results from the identical frame can be partitioned into four 
comer submatrices. In addition, the major diagonal has the object-slot 
responding while the minor diagonal has the subject-slot responding. Hence, 
even without the word "back", the identical frame was responded to with the 

interpretation "to interchange the two actors of the actions". 

At the other extreme, the results from the nonsense frame are nearly 

all random. Here the Ss lacked both the semantic cues from the verbs and 



Table 5-10 

Response matrices for 26 male^and 28 female eighth grade subjects. 
Sentence type: John Yed Bill and he Xed him. 
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the structural cues from this sentence type (viz., the word "back") to 
assist them in giving a uniform response. The next section will attempt 
to characterize the degeneration by a k— limited transducer. 

As for the similar frame, the partition into submatrices is still 
observable as well as a difference in response between the major and the 
minor diagonals. Again, the most plausible interpretation used by the 
^s was "to interchange the two actors of the two actions", although the 
structural cues from the word "back" were absent. However, responses to 
the more syntactically complex sentences are random. 

As for the logical reversal frame, the four "deviant" cells are still 
observable as are the four comer submatrices. However, since the more 
cally complex constructions tend to degenerate in the responses 
they elicit, all the cell numbers in the four bottom-right corner submatrix 
(sentences with passive-passive construction) are close to each other, so 
that it is hard to classify one of them (P-P) as "deviant" from the other 
three. Besides these "deviant" ones, the starred cells in the major and 
minor diagonals confirm the expected "reversal" responding. 

^ • Analysis b y the k— limited transducer model . Since we are dealing 
with one age group, this model is intended to account for the observed 
degeneration among the 16 sentences in each frame. Since all the sen- 
tences in the Each other type were responded to randomly, we cannot talk 
about degeneration. In Table 5-11 we tabulate the 4-tuples (e,f,g,h) as 
measures of degeneration for each frame from the He Xed him back and He 
types. The elements in the 4-tuples are respectively the numbers 
of starred cells of submatrices in the top-left comer, the top-right 
corner, the bottom- left corner and the bottom-right comer. 
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The observed fatt that the elements of the A-tuples are nonin- 
creasing from active to passive confirms our expectation that passive 
transformation is the major contributor to the complexity in a sentence. 

Table 5-11 

Number of starred cells in each corner submatrix 
for the eighth grade subjects 

Nonsense Similar Log. Rev . Identical 

He Xed him 1,0, 0,0 3, 3, 2,0 3, 2, 2,1 A, 4, A, 2 

He Xed him back 3, 1,0,0 A, 3, 3,0 3, 1,2,1 A,A,A,1 

In terms of k-limited transducers, the needed k for correct decoding 
increases from sentences with active-active clauses to sentences with 
active-passive clauses to sentences with passive- active clauses to, 
finally, sentences with passive-passive clauses, all without regard to 

the negative transformations. 

For the Hp Xed him back type, the decoded sentences plus the assign- 
ment "programs" are within the limits of the k' of the assignment trans- 
ducer for all frames except the nonsense frame. This Is the case because 
the first elements of the 4-tuples, which correspond to sentences with 
active-active constructions, are maximal for these three frames (note that 
3 Is the maximal value for the logical reversal frame) . The Identical 
frame also has maximal values for the second and third elements of Its 
4-tuple, while the similar and logical reversal frames do not, which sug- 
gests that the decoding was facilitated when the two verbs were Identical. 
This seems reasonable, since both clauses use the same verb to describe 
an action between two persons so that If a S had decoded one clause 




90 



correctly, then the decoding of the other clause could have been greatly 
facilitated. However, this interpretation would suggest that the responses 
to the four bottom-right cells (the passive-passive construction) should 
be more uniform than the four bottom-left cells (the passive-active con- 
structions) , since we would expect less interference in transferring the 
decoded message from one passive clause to another passive clause than in 
transferring from a passive clause to an active clause. In terms of de- 
generation and the 4-tuples (e,f,g,h), we would then expect g< h for the 
identical frame. But we observe the contrary. The contradiction is pro- 
bably due to our arbitrariness in selecting the critical point of .01 
(i.e., a * .01) in the Bernoulli trials model, but more importantly to 
the maximal likelihood estimator The value of ^ thus computed was 

greater than .50 ‘actually .55), which is biased toward responding to 
the object-slots. Thus for the same critical level of a , it requires 
more uniform responding, as measured by percent of agreement among the ^s, 
to attain a significant result (i.e. , being starred) when the response is 
for the object-slot than is for the subject-slot. Thus the Bernoulli trials 
model, in this case, tends to give a smaller value for h. If we take the 
average percent agreement among the in the four comer submatrices, we 
would obtain the following 4-tuple: (99, 71, 67, 70). Thus, as far as 

the Ss' agreement in their responses is concerned, there was actually more 
agreement in the passive-passive constructions (fourth element) than the 
passive- active constructions (third element), although their numerical 
difference in this case is small. Hence, the result is consistent with 
the hypothesis that identical verb-pairs tended to facilitate the decoding 
of a second clause after having decoded correctly the first clause, but 
that the facilitation may not be enoug)i to overcome the bias against 
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2 

attaining a predetermined tail probability level* Finally for the nonsense 
frame of the He Xed him back type, even the value of e is not maximal. 

Thus for the most simply constructed sentences (the active-active construct- 
tions) the eighth grade group could not make the pronominal assignment as 
easily for the nonsense verb-pairs as for the real verb-pairs. In terms 
of the k-limited transducer, we say that the "assignment program" for 
sentences with nonsense verb-pairs was sufficiently longer than for sen- 
tences with real verb-pairs so that the limits of the memory span k* was 
exceeded for some ^s. If, in addition, we introduce syntactic complexity 
in sentences , we observe a very quick degeneration into complete random 
responding, as witnessed by the 4-tuple (3, 1,0,0). 

For the He Xed him type, both the identical and the logical reversal 
frames have attained the maximal value for at least the first element of 
the 4-tuple (e,f,g,h), which implies that sentences with either identical 
or logical reversal verb-pairs were well within the limits of the k* of 
the assignment transducer. For these sentences, the ^s seem to do as well 
whether the word "back" was present or not. For the identical frame, both 
the second and the third elements of the 4-tuple are also maximal, but 
g > h. Again if we calculate the percent agreement for the four comer 
submatrices, we obtain the following 4-tuple: (91, 68, 65, 75). Thus 

there was actually a higher degree of agreement among the Ss* responses 
under passive-passive constructions than under the passive-active con- 

^The data from the identical He Xed him frame in this group and from 
both frames in the other age groups are more convincing in supporting 
this hypothesis. For example, the 4-tuple in percent agreement for 
the identical He Xed him frame in the eight grade is (91, 68, 65, 75); 
the 4-tuples in percent agreement for the identical He Xed him back 
and He Xed him frames for the college sophomore group are (94,90, 79, 86) 
and (100, 82, 75, 93) respectively; the 4-tuples in percent agreement 
for the identical He Xed him back and He Xed him frames for the seventh 
grade group are (87, 69, 56, 67) and (95, 81, 60, 75) respectively. 
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structions. For the nonsense frames, however, the assignment was so 
difficult that even the correctly decoded message exceeded the limits of 
the k' of the assignment transducer. Likewise the non— maximal value 
for e in the similar frame suggests that the limits of the k is 
exceeded by some ^s, and as sentences become more complex syntactically, 
the responses became random, as witnessed by h — 0. 

c. Analysis by the information transmission model . The results 
from an analysis by the information transmission model appear in Table 5-12. 
The columns are components of transmitted information (the T-terms) and 
interaction (the A-terms) terms. Again any checked (/) term is highly 
significant, i.e., under the null hypothesis of no transmission between 
the input sources and the output source, the probability of obtaining the 
term of such a magnitude is < .001. Similarly, any starred (*) term is 
significant, i.e., the probability of obtaining such a magnitude under 
the null hypothesis is < .01. 

The T-terms for the three frames in the Each other type are all non- 
significant. Thus the output response was independent from the input 
sources, i.e., any knowledge of the grammatical form of the first clause 
(input source A) , the grammatical form of the second clause (input source B) . 
and/or the sex of the S. (input source C) did not influence the prediction 
of the pronominal referent (output source Z) . This is consistent with the 
interpretation from the Bernoulli trials model, that each of the 16 sen- 
tences of the Each other frames elicited random responses. 

For the four frames in the He Xed him back type, all the doubly- 
controlled (knowing input sources A and B) and triply-controlled 
(knowing all three input sources) T-terms are highly significant, except 
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one, which is significant. Hence even though only four sentences are 
individually unambiguous (by the Bernoulli trials) in the nonsense 
frame, the knowledge of the grammatical forms of the two clauses in a 
sentence was sufficient to predict the response. That is, the 16 sen- 
tences of the nonsense frame, on the average, were not ambiguous. Many 
of the T(A) and T(B) terms are also significant or highly significant, 
which is caused by differences in the resulting responses when summed 
over the sources A and C and in the resulting responses when summed 
over the sources B and C. Hence there is some tendance for this 
younger group to "jump" to a response based on the grammatical form of 
the two clauses. The maximum T(AB) term comes from the identical frame. 
Its value is 21.5%, which corresponds to an m of 13 in Table C-1. 

That is, we can expect a maximum of (54 - 13) /54 “ 76% agreement among 
the 54 ^s in their response to the He Xed him back type. 

As for the He Xed him type, all the doubly-controlled and triply- 
controlled T-terms, except for the nonsense frame, are highly significant. 
Thus, on the average, sentences with real verb-pairs of this type are con- 
sidered unambiguous. In other words, there were enough cues "transmitted" 
from the sentences to the ^s to produce uniform responding. On the other 
hand, when nonsense verb-pairs were used (without the word "back"), the 
doubly- controlled T-terms are not significant, i.e., there were not enough 
cues "transmitted" for uniform responding. Some of the T(A) and T(C) 
terms are checked or starred which is caused by some bias in the resulting 
responses when summed over the other two sources. For this sentence type, 
the maximum T(AB) is 23% which corresponds to an m value of 12 
in Table C-1. Hence we can expect at most a (54 - 12) /54 « 78% agree- 
ment among the 54 eighth grade Ss in the sentence type He Xed him. 
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The maximal agreement for both the He Xed him and the He Xed him 
back types occurred when the verb-pairs were identical* The fact that 
the identical He Xed him has a higher agreement than the identical He 
Xed him back suggests some interference between the cues from the word 
"back" and the cues from "identical verbs". The T(AB)/M terms are not 
consistently greater (or less) than the corresponding T(AB)/F terms; 
hence there is no indication that either the male or the female ^s 
could out-perform the other. 

One question we wish to ask is whether the ^s based their responses 
on the sentence type or on the particular verb-pairs. Hence we want to 
find out whether there is any difference between the responses from two 
classes of verb-pairs for the same sentence type and whether there is any 
difference between the responses from two sentence types with the same 
class of verb-pairs. In Table 5-13, we have tabulated the values 

between any two sentence types for each class of verb— pairs. In contrast 
with the college sophomore group, the eighth grade ^s did not make any sig- 
nificant different responses between the He Xed him and the He Xed him back 
types. Of course the Each other type was responded to differently from 
the other two types. However, the nonsense He Xed him frame was not sig- 
nificantly different from the nonsense Each other frame; hence the non- 
sense He Xed him frame must also be random, a fact consistent with the 
finding from the Bernoulli trials model. 

Table 5-14 tabulates the x^(16) values between any two verb-pairs 
in each sentence type. We note again that the _Ss responded to the Each 
other sentence type in a way that was independent of the particular class 
of verb-pairs, since all the sentences elicited random responding. For 
the He Xed him back sentence type, the £s responded alike to sentences 
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Table 5-13 



Chi-square values between any two sentence types for each 
class of verb-pairs for the eighth grade subjects. 





Xed him 
vs 

Xed him back 


Xed him 
vs 

Each other 


Xed him back 
vs 

Each other 


Nonsense verbs 


22.7 


18.6 


46.9/ 


Similar verbs 


29.6 


44 c 6/ 


71.7/ 


Log. Rev. verbs 


6.3 


69.1/ 


54.3/ 


Identical verbs 


9.2 







Checked item (V) has p < .001 



Table 5-14 



Chi-square values between any two verb-pairs in each 
sentence type for the eighth grade subjects. 





NS-SI 


NS-LR 


NS-ID 


SI-LR 


SI-ID 


LR-ID 


He Xed him 


44c 3 / 


91.7/ 


143.8/ 


159.1/ 


55.4/ 


310.7/ 


He Xed him back 


30,6 


118.6/ 


64,7v 


199.9/ 


20.2 


251.9/ 


Each other 


7.9 


5,. 8 




2.8 






Checked Item (V) 


has p 


.001 
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using either nonsense or similar verb-pairs and to sentences using either 
similar or identical verb— pairs (but not to sentences using either non- 
sense or identical verb-pairs) , This was not true for the He Xed him 
sentence type, since in the absence of the cue word "back", the verbs 
must carry the "message" about the pronominal referent. In addition, the 
very large values for the NS-LR, SI-LR and LR-ID differences in 

the He Xed him and the He Xed him back types indicate that the logical 
reversal verb-pairs were treated very differently from the other verb-pairs. 

d. Summary . The results from the eighth grade ^s demonstrated that 
they could largely resolve the pronominal referent when real verbs were 
used. With the Bernoulli trials model, it was found that all sentences 
ill the Each other type were ambiguous. For the other two sentence types, 
however, it was found that nearly all the sentences in the identical frames 
were unambiguous while nearly all the sentences in the nonsense frames 
were ambiguous. The set of unstarred cells (the sentences corresponding 
to these cells are interpreted as ambiguous) for the frames with real 
verbs contained sentences that were more complex syntactically. Hence the 
k-limited transducer model used this fact to explain the degeneration in 
the ^s responses, i.e., the more syntactically complex sentences tended 
to exceed the memory span in the decoding stage of the two-stage model. 

With the nonsense He Xed him frame, the unstarred set contained 15 of 
the 16 cells; hence the k-limited transducer model explained the degen- 
eration in which the memory span in the assignment stage was exceeded. 

} the information transmission model computed the T— terms , which 
measured the dependency or predictability between the input sources and 
the response. The signigicant and the highly significant T-terms provided 
those combinations of the input sources which produced uniform responding. 



In addition, since the T- terms were calculated as an average value over 
the 16 sentences of a frame, these served to summarize the £s’ responses 
for each sentence frame instead of individual sentences (of which there 
are 176) . 

5.3 The Seventh Grade Group 

The basic data for the seventh grade group appear in Tables 5-15, 

5-16 and 5-17. There were 13 male Ss with an average age of 12.5 years 
and an average I.Q. score of 115 and 13 female £s with an average age 
also of 12.5 years and an average I.Q. score of 116. They were students 
at the West Junior High School in Ypsilanti, Michigan. Table 5-15 contains 
the results of three sentence frames of the Each other type; Table 5-16 con- 
tains the results of four sentence frames of the He Xed him back type; and 
Table 5-17 contains the results of four sentence frames of the He Xed him 
type. The results are tabulated separately for the male and female ^s and 
their pooled data are denoted as "Total". The difference between the male 
and female responses for each sentence frame is calculated as a chi-square 
value, which appears in the last column (under x Table 5-19. 

Since there was no significant difference between the male and the female 
responses for any of the sentence frames, subsequent discussion will per- 
tain to the pooled data. 

Each response matrix in Tables 5-15, 5-16 and 5-17 represents the 16 
possible sentences in a sentence frame. The rows in each response matrix 
denote the grammatical forms of the first clause, while the columns denote 
the grammaticl forms of the second clause. Each cell of the response 
matrices corresponds to a particular sentence, and the number in the cell 
is the number of _^s who selected the grammatical object— slot as the refer- 
ent of the pronoun he. For the nonsense frames, there were 12 female S_s 



instead of 13. 
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j^alysls by the Bernoulli trials model . The tsaximum likelihood 
estimator p for each sentence frame (a matrix) appears at the bottom of 
the matrix. Next to the value of p is the confidence interval [u,v] 
where the tail probability a is chosen as .01. Hence, if we assume a 
Bernoulli trial, i.e., that all the n ^_s selected the grammatical object- 
slot as the referent of the pronoun he independently and with the same 
probability p, then the probability of obtaining Y object-slots is 
<.01, if Y^u or Y>v. Those cells whose numbers are ^ u or ^ v 
are starred. The sentences corresponding to these cells are interpreted as 
unambiguous sentences. 

First, none of the cells in the response matrices in Table 5-15 are 
starred (except one, which is probably due to sampling error) ; hence all 
the sentences belonging to the Each other type are interpreted as ambiguous, 
again as we would expect. It is interesting to note that the estimated 
values of p (.56 to .60) are above .50, which shows a bias toward 
responding in the object-slot. The older groups (including the ninth 
grade group) were all biased toward responding in the subject— slot , and 
especially the college sophomore group, whose estimated values of p 
ranged from .26 to .28. It could be that when the structure of a sen- 
tence does not provide any cue for selecting the pronominal referent, 
adults tend to select the first mentioned person (i.e., a type of primacy 
effect) whereas younger children tend to select the person mentioned 
closest to the pronoun (i.e., a type of recency effect). However, the 
evidence presented so far is not convincing, even though the fifth grade’s 
estimated p is in the range of .53 to .57. Perhaps this "recency 
effect" will appear more strongly when we examine the responses of much 
younger children, whose memory spans are exceeded by the sentence length. 
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Table 5-15 

Response mstrlces for 13 male and 13 female Wnth grade sobjects 
Sentence type: John and Bill Yed each other and be 
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As for the responses from the sentence type He Xed him back , which 
appear in Table 5~16, a majority of the sentences were responded to ran- 
domly. We can discern a partition of the response matrix into corner sub- 
matrices only in the identical frame. Among the starred cells in the 
identical frame, the responses were clearly "diagonal", and, in addition, 
were consistent with the intended interpretation for the word "back". 
Without the added cues from pairs of identical verbs, however, this group 
of younger £s was not able to assign the pronominal referent uniformly, 
except for the most simple active-active constructions. As for the logical 
reversal frame, we observe the "reversal" response, since the cells in the 
four comer submatrices have large numbers (e.g., greater than n^) where 
the corresponding cells in the identical frame have low numbers (e.g., less 
than n^) , and vice versa. Of the 16 cells, only four of them attained 
the .01 level. Such a "reversal" response supports the hypothesis that 
the seventh grade children responded to the cues from the class of logical 
reversal verb— pairs, i.e., cues creating a previous action with a change 
of subject-object relationship. For the older ^s, a negative clause 
(either A or ?) followed by an affirmative clause (either A or P) 
tended to elicit a non-reversal responding. However, since the responses 
are largely random for the logical reversal frame (for a relatively small 
sample of 26 ^s) , it is not clear whether the negative-affirmative con- 

structions would also elicit non-reversal responding, especially in view 
of the fact that one of the four cells (cell P-P) actually elicited a 
reversal response. We need to test again for this effect with a larger 
sample. 

Finally we have the response matrices from the sentence type He Xed 
him, which appear in Table 5-17. Again a majority of the sentences were 
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responded to ambiguously. However, the identical frame produced a "dia- 
gonal" response which is consistent with responding to the deep structure 
of the sentences, and, in addition, follow the expected interpretation of 
"to interchange the two actors of the two actions". For the logical re- 
versal frame, the responses are clearly "reversal", although there are 
only four starred cells. The remaining cells have small numbers when the 
corresponding cells in the identical frame have large numbers and vice 
versa. It is not clear how the ^s responded to the negative-affirmative 
constructions, since all these four deviant cells received random responding. 
For the nonsense frame, no cells are starred, indicating that the ^s were 
not able to gather enough cues from sentences with nonsense verb-pairs to 
properly assign the referent, not even from the most simple A-A sentence. 

b. Analysis bv the k-limited transducer model . In this section, we 
shall account for the observed degenerative behavior among the 16 sen- 
tences in each frame by the k-limited transducer model. Table 5-18 tabu- 
lates the 4-tuples (e,f,g,h) for each sentence frame in the He , Xed . J4 m 
and the He Xed him back types. 

We observe that only the identical frames produced maximal values 
in the 4- tuples, which means that the limits of the assignment transducer 
could not have been exceeded. However, as the sentences became more 
complex syntactically, the limits of the decoding transducer were exceeded, 
hence the drop from a maximum of four to two for element g and zero for 
element h of the 4-tuple (e,f,g,h). The 4-tuples for the similar and 
* the logical reversal frames in both sentence types may be explained by a 
pair of k-limited transducers where the limits of the second (assignment) 
transducer were exceeded by some _Ss, hence the non- maximal values for the 
first element of the 4-tuples; and where the limits of the first (decoding) 
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transducer were exceeded by passive constructions » hence the value zero 
or two for the other three elements of the 4-tuples. Finally, for the 
nonsense He Xed him frame, none of its 16 sentences was responded to 
uniformly, that is, there were not enough cues to produce a correct 

Table 5-18 

Number of starred cells in each corner submatrix 
for the seventh grade subjects 

Nonsense Similar Log. Rev. Identical — 

He Xed him 0,0, 0,0 1,2, 0,0 3, 1,0,0 4, 4, 2,0 

He Xed him back 3, 0,0,1 2, 0,0,0 2, 0,1,1 4, 4, 2,0 

assignment. The all-random responses may be accounted for by a pair of 
k-limited transducers where the limits of the second (assignment) trans- 
ducer were exceeded by all the ^s. For the nonsense He Xed him back 
frame, the cues from the word "back" made the assignment much easier, 
at least for the simple active-active constructions. But when sentences 
contained passive grammatical forms the assignment became random, probably 
due to incorrect decoding of the sentences into their respective deep 
structures. Hence we can explain the degeneration of this nonsense 
He Xed him back frame with the k-limited transducer model where the limits 
of the second transducer were not exceeded, but where the limits of the 
first (decoding) transducer were exceeded whenever there was a passive 
transformation. 

c. Analysis bv the information transmission model . The results of 
the analysis by information transmission model appear in Table 5-19. The 
columns are components of transmitted information (the T-terms) and inter- 
action (the A-terms) terms. Again any checked (/) term is highly sign- 
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ificant, i.e., under the null hypothesis of no transmission between the 
input sources and the output source, the probability of obtaining the term 
of such a magnitude is < .001. Similarly, any starred (*) term is sig- 
iiificant when the probability of obtaining such a magnitude under the null 
hypothesis is < .01. 

The T-terms for the three frames in the Each other typ>^ are all non- 
significant. Thus the output response was independent from the input 
sources, i.e., any knowledge of the gramatical form of the first clause 
(input source A), the grammatical form of the second clause (input source 
B) and/or the sex of the ^s (input source C) did not influence the predic- 
tion of the pronominal referent (output source Z) . This is consistent with 
the finding from the Bernoulli trials model that each of the 16 sentences 
in any of the Each other frames was a random response. 

for the four frames of the He Xed him back type, the T(AB.) terms 
are all highly significant, while some of the other doubly-controlled and 
triply-controlled T-terms are either significant or nonsignificant. Thus 
the group as a whole was able to use the cues from the word "back" for 
uniform responding for any class of verb-pairs. However, when the male 
and female responses are analyzed separately, although the T(AB) /M or 
T(AB)/F terms are of the same magnitude as T(AB) , some of these are not 
significant. Since the value for the terms is directly proportional 

to the sample size n (x^ = 1.3863 n T) , a reduction in n gives a 
corresponding reduction in the x^ value. Some of the T(A) and T(B) 
terms are also significant or highly significant, which suggest some bias 
in the resulting responses when summed over the other two input sources. 

The maximum T(AB) term comes from the identical frame. Its value of 
18% corresponds to an m of 7 in Table C-1. Thus we can expect at 

most a (26-7) /26 = 73% agreement among the 26 seventh grade £s in 

o 

ERIC 
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responding to the He Xed him back type. 

As for the He Xed him type, only the identical and the logical rever- 
sal frames hhl^ highly significant T-terms. Hence in the absence of the 
cues from the word "back", a knowledge of the grammatical forms of the two 
clauses in the nonsense frame could not predict the response, and the pre- 
diction is significant (though not highly significant) for the similar 
frame. The maximum T(AB) term comes from the identical frame. Its value 
of 37% corresponds to an m of 5 in Table C-1. Thus we can expect 
at most a (26-5) /26 *= 81% agreement among the 26 seventh grade ^s in 
responding to the He Xed him type. 

Again we observe a larger maximum agreement when the word "back" was 
absent in sentences with identical verb-pairs. Thus the double cues from 
"back" and "identical verbs" produced less uniform response than just 
"identical verbs". On the other hand, the cues from "back" alone produced 
much more uniform responding with either the nonsense verbs or the sim- 
ilar verbs than without the word "back* . The maximum of 81% agreement 
for the identical He Xed him frame is actually higher than the corresponding 
one among the eighth grade ^s, which is 78%. The relatively small dif- 
ference between 81% and 78% produced a larger difference in the max- 
imum T(AB;Z) of 32% and 23% may nevertheless be a sampling error. 

Again, we wish to know whether the ^s based their responses on the 
sentence type or on the particular verb-pairs. Hence we want to find out 
whether there is any difference between responses from two classes of verb- 
pairs for the same sentence type and whether there is any difference between 
the responses from any two sentence types with the same class of verb- 
pairs. In Table 5-20, we have tabulated the x^(16) values between any 
two sentence types for each class of verb-pairs. As with the eighth grade 
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Table 5-20 



Chi-square values between any two sentence types for each 
class of verb-pairs for the seventh grade subjects. 





Xed him 
vs 

Xed him back 


Xed him 
vs 

Each other 


Xed him back 
vs 

Each other 


Nonsense verbs 


9.0 


9.7 


22.1 


Similar verbs 


4.3 


16.4 


32.6* 


Log. Rev. verbs 


9.0 


43.0/ 


40.4/ 


Identical verbs 


13.3 







Checked item (/) has p < .001 
Starred item (*) has p < .01 



Table 5-21 

Chi-square values between any two verb-pairs in each 
sentence type for the seventh grade subjects. 





NS-SI 


NS-LR 


NS-ID 


SI-LR 


SI-ID 


LR-ID 


He Xed him 


6.7 


55.6/ 


45 .6/ 


68.6/ 


40.6/ 


164.8/ 


He Xed him back 


7.6 


70.3/ 


23.3 


80.4/ 


21.3 


113.8/ 


Each other 


6.8 


8.8 




13.0 







Checked item (/) has p < .001 
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^s, there was no significant difference between the He Xed him and the He 
Xed him back types. However, in contrast with the two older groups, there 
is no significant difference among the three sentence types with nonsense 
verb-pairs, as well as with the similar frames of the He Xed him and the 
Each other types. Since the responses to the Each other type were inter- 
preted as random, the responses from the nonsense He Xed him , the nonsense 
He Xed him back ,, and the similar He Xed him frames were also essentially 

random, as these were not significantly different from the Each other 
type. 

Table 5-21 tabulates the x^(16) values between any two verb— pairs 
in each sentence type. Again we note that the Ss responded to the Each 

sentence type in a way that was independent of the particular class 
of verb-pairs, since all the sentences elicited random responding. Large 
significant differences still come from NS-LR, SI-LR and LR-ID pairs, 
which confirm our finding from the Bernoulli trials model that the response 
matrices using the logical reversal verb-pairs in the He Xed him and He 
Xed him back types included the "reversal" response, although both had 
only four starred cells. The only other highly significant terms are the 
NS— ID and SI— ID pairs in the He Xed him type. Thus in the absence of 
the word back", the Identical frame produced a unique response which is 
different from the nonsense and similar (and, or course, also the logical 
reversal) frames. However, when the word 'back" was present, all the 
nonsense, simxlar and identical frames in the He Xed him back type were 
responded to alike. 

d. Summary . The results from the seventh grade ^s showed a further 
degeneration with respect to the responses of the eighth grade Ss, These 
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may be attributed to shorter memory spans, both in the decoding and in 
the assignment stages of the k-limited transducer model. With the 
Bernoulli trials model, it was found that all the cells in the Each 
other type were not starred, and the sentences corresponding to these 
were interpreted as ambiguous. For the other two sentence types, only 
the identical frames have more than half of the cells starred, and the 
sentences corresponding to these starred cells were interpreted as unam- 
biguous . Although only four cells in the logical reversal frames were 
starred, the direction in which these cells attained the ,01 signifi- 
cance level, i,e., whether the cell number was $ u or > v, indicated 
that the response matrix included "reversal" responses. The k-limited 
transducer model also used the assumed shorter memory span of seventh 
grade ^s to explain the degeneration of response across the 16 sentences 
of a frame. Finally with the information transmission model, it was 
found that the T(AB) terms for all four frames in the He Xed him back 
type were highly significant while only the identical the the logical 
reversal frames in the He Xed him frame were highly significant. That 
is to say, we can predict the output response when the grammatical form 
of the two clauses are known in these six frames. In addition the max- 
imum percentage agreement for the identical He Xed him frame turned out 
to be higher for seventh than for eighth grade ^s. 

5.4 The Fifth Grade Group 

The basic data for the fifth grade group appear in Table 5-22, 5-23 
and 5-24. There were 14 male ^s with an average age of 10.7 years 
and an average I.Q. score of 98, and 9 female ^s with an average age 



o 

ERIC 



112 



of 10.8 years and an average I.Q. score of 105. There were two £s, 
one male ^ and one female who were not able to read. The teacher read 
the test material, one sentence at a time, to the male non-reader and 
helped him mark an "x" on top of the word he orally selected, while the 
writer did the same with the female non-reader. When their responses were 
tabulated, there were no systematic differences with the other ^s in this 
age group, and their responses were pooled along with the rest of the fifth 
graders. The ^s were students at the Adams Elementary School in Ypsilanti, 
Michigan. 

Each response matrix in Tables 5-22, 5-23 and 5-24 represents the 16 
possible sentences in a sentence frame. The rows in each matrix denote 
the grammatical forms of the first clause, and the columns denote the gram- 
matical forms of the second clause. Each cell of the response matrices 
corresponds to a particular sentence, and the number in the cell is the 
number of ^s who selected the gramatical object-slot as the referent of 
the pronoun he . The male and the female responses are tabulated separiately 
with their pooled responses as "Total". Since the male and the female 
responses are not significantly different, subsequent discussion will 
pertain to only the "Total" matrices. 

a. Analysis by the Bernoulli trials model . The maximum likelihood 
estimator p for each sentence frame (a matrix) appears at the bottom of 
the matrix. Next to the value of p is the confidence interval [u,v] 
where the tail probability a is chosen as .01. Hence, if we assume a 
Bernoulli trials model, i.e., all the n ^s selected the grammatical 
object-slot as the referent of the pronoun independently and with the 
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Response matrices for 14 male and 9 female fifth grade subjects 
Sentence type: John Yed Bill and he Xed him* 
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same probability p, then the probability of obtaining Y object- 
slots is < .01, if Y ^ u or Y > v. Those cells whose numbers are 
< u or > V are starred. The sentences corresponding to these cells 
are interpreted as unambiguous sentences. 

Of the 176 cells in the 11 response matrices, only four cells 
are starred, i.e., there were four unambiguously interpreted sentences by 
the fifth grade subjects. As expected, all the sentences from the Each 
other sentence type were ambiguous. As for the He Xed him back type, only 
two sentences from the identical frame were unambiguous, while for the He 
Xed him type, only one sentence each from the identical and the logical 
reversal frames were unambiguous. Thus, for all practical purposes, the 
fifth grade ^s were not able to resolve the ambiguity of the pronominal 
referent; that is, they were not able to use the cues from either the 
word "back" or the verb-pairs to produce a uniform response. 

b. Analysis by the k- limited transducer model . Again this model is 
intended to account for the observed degenerative behavior among the 16 
sentences in each frame. Table 5-25 tabulates the 4-tuples (e,f,g,h) 
for each sentence frame in the He Xed him and the He Xed him back types. 
Since this is our youngest age group, we can assume that the ^s have 
shorter memory spans, on the average, than the other groups. Assuming 
shorter memory spans , we may account for the nearly complete random re- 
sponding in the following way. Since the A-A sentence contains clausses 
whose surface structure is the same as its deep structure, its demand on 
decoding is minimal. Fraser, et al [1963] found that children of age 
three were able to decode active sentences; hence it seems reasonable to 
expect children of age 10.7 (average age of the fifth graders) would be 
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able to decode the A-A sentences. Yet the _Ss were not able to respond 
uniformly to this simplest construction; therefore it seems safe to assume 
that the difficulty was not in the decoding stage, but rather in the 

Table 5-25 

Number of starred cells in each comer submatrix 
for the fifth grade subjects 

Nonsen ce Similar Log. Rev. Identical 

He Xed him 0,0, 0,0 0,0, 0,0 0,0, 1,0 0,1, 0,0 

He Xed him back 0,0, 0,0 0,0, 0,0 0,0, 0,0 1,0, 1,0 

assignment. We can then account for the observed data by the k-limited 
transducer model where the limits of the k’ in the assignment stage were 
exceeded. 



c* Analysis by the information transmission model . The results from 
the analysis by information transmission model appear in Table 5-26. The 
columns are components of transmitted information (the T- terms) and inter- 
action (the A-terms) terms. Again any starred (*) term is significant 
i.e., under the null hypothesis of no transmission between the input sources 
and the output source, the probability of obtaining the term of such a mag- 
nitude is < .01. 

The only doubly— controlled T— terms that are significant come from the 
identical He Xed him back frame. Hence under the optimum condition of 
having cues both from the word ."back" and from "identical verbs", this 
youngest group of Ss was able to produce a significantly uniform response. 
The T(A) term for this frame is also significant, which suggests that 
knowing either the grammatical form of the first clause or knowing both 
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Table 5-27 



Chi-square values between any two sentence types for each 
class of verb-pairs for the fifth grade subjects. 





Xed him 
vs 

Xed him back 


Xed him 
vs 

Each other 


Xed him back 
vs 

Each other 


Nonsense verbs 


1.2 


7.7 


4.4 


Similar verbs 


5 . 6 


4.2 


4.4 


Log. Rev. verbs 


18.5 


12.3 


5.7 


Identical verbs 


4.9 







Table 5-28 



Chi-square values 


between 


any two 


verb-pairs in 


each 




sentence type 


for the 


fifth grade subjects. 






NS-SI 


NS-LR 


NS-ID 


SI-LR 


SI-ID 


LR-ID 


He Xed him 8.2 


8.3 


11.6 


12.9 


8.4 


25.6 


He Xed him back 5.1 


12.1 


12.9 


15.3 


9.8 


23.1 


Each other 11.3 


10.0 




9.8 
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clauses can predict the response better than chance. The generally non- ' 

significant results from the information transmission model imply that ' 

the ^s did not respond according to the structure of the sentence (except 
those in the identical He Xed him back frame). This finding is consistent ' 

with the finding from the Bernoulli trials model that only four sentences 
out of a total of 176 were interpreted as unambiguous. ' 

In Table 5-27 and 5-28, the x^(16) values are tabulated for the | 

differences between any two sentence types for each class of verb-pairs 
and between any two classes of verb-pairs in each sentence type. Since 1 

there were only four starred cells in the Bernoulli trials model and the 
output response was found to be independent of the input sources (except i 

for the identical He Xed him back frame) in the information transmission i 

i 

model, it should not be surprising that all the pair-wise differences are 
small and nonsignificant. In fact even the logical reversal verb-pairs 
were responded to no different from the other verb-pairs. 

4 

d. Summary . In general, the fifth grade were not able to re- 
solve the pronominal ambiguities. For this youngest group, the nearly 
complete random responding was characterized by the k-limited transducer 
model as a case in which the limits of memory span were exceeded. The 
Bernoulli trials model found that nearly all the sentences in every frame 
were random, while the information transmission model found that the 
output responses were independent from the input sources. All models 
support the conclusion that the fifth-graders could not resolve the pro- 
nominal ambiguity. 
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Chapter 6 



DISCUSSION AND SUMMARY 



We have proposed three mathematical models to analyze the observed 
data in the form of contingency tables. The Bernoulli trials model takes 
the responses to a block (in this case, a frame of 16 sentences) of sen- 
tences and applies probability theory to determine the probability level 
of each response. Three assumptions must be satisfied before we can apply 

this model: 

1) Each trial has only two outcomes. 

2) Each trial is independent from any other trial. 

3) The probabilities of the two outcomes remain constant in all trials. 

We are treating each trial as "one S_ responding to one sentence . 

Since the ^s were instructed to mark an on top of either John or Bill 
(and not both) for every sentence, each trial has exactly two outcomes. 

Since each S was working on his own questionnaire booklet, it is reason- 
able to assume that one ^'s responses were independent from any other ^*s 
responses. The third assumption requires that each ^ in a group has the 
same probability in selecting, for example, the word John* impos- 

sible to know whether this assumption is, or can ever be, satisfied, ihe 
procedure used was to apply the best available estimator, the maximum like- 
lihood estimator p, to a large sample. The sample selected was the 16 
different sentences in a frame, giving a total of 16n sample points where 
n is the number of ^s in a group. 
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This estimated p does not distinguish between a group of ^s in 
which all of them truly were neutral in their selection of John or Bill 
as the referent of the pronoun ^ in a sentence and a group where half 
of the ^s strongly felt the selection was Bill and the other half strongly 
felt the selection was John * Assuming that the ^s in both groups were 
consistent in treating the 16 sentences in a frame, then both groups 
would give a p close to .50. Fortunately, we need not distinguish the 
two cases, since we are concerned with whether the sentences are ambig- 
uous or not. In both groups, since the sentences had elicited non- 
uniform responses, the sentences must be considered ambiguous. However, 
the estimated p along with the confidence interval [u,v] can distin- 
guish the above group responses from one in which the subject-slot was 
uniformly chosen in eight of the 16 sentences and the object-slot was 
chosen in the remaining eight sentences (e.g., the "diagonal" response). 
Again the estimated p is close to .50, but since each of the individual 
cell numbers is either high (uniform object-slot) or low (uniform subject- 
slot), they will all fall outside the confidence interval, i.e., either 
!$ u or > V. Since the selected tail-probability a is .01, then the 
probability for the number in a cell being 4 u or v is < .01, i.e., 

it is not likely that the response to that cell was the result of Bernoulli 



For those who are interested in the ^s* response behavior in an all- 
or-none learning situation, the distinction is important. Under such cir- 
cumstances, a more complex experiment could be designed in which each ^ is 
asked to give his response together with a three-point or five-point scale 
for his subjective strength for the response. Without such a subjective 
strength scale, it is impossible to distinguish with a matrix of zeros and 
ones (response of one ^ to one sentence frame) between the result of random 
selection and the result of language usage. Any matrix of zeros and ones 
is equi-probable to any other matrix of zeros and ones. But when a group 
of such matrices are summed, the numbers in each cell follow a binomial 
distribution, if each matrix was the result of random selection from each 
If the cell numbers deviate significantly from a binomial distribution, 
as measured by the confidence interval [u,v], then we have some confidence 
that the responses were based on language usage, and therefore, the sen- 
tences are not ambiguous. 
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trials. Hence we interpret the sentence that produced such a response 
as an unambiguous sentence. 

The k-limited transducer is a precise mathematical model; however, 
when applied to the present data, it serves as conceptual means to explain 
degenerative responding, either when sentences become more complex syntac- 
tically, or when S_s become younger. Since we do not know the psychologi- 
cal unit, whether it be the morpheme, the word, or some higher structured 
linguistic unit, that the ^s used to analyze each sentence and to select 
the pronominal referent, we cannot be very precise about applying the mod- 
el. The basic assumptions of the model are that the internal states of 
the transducer are finite and that each transition depends on reading the 
input tape of at most k units long. These two assumptions are easily sat- 
isfied, since the human brain is definitely finite and there is surely a 
certain maximum length of sentence beyond which _S,s cannot process correctly. 

In their experiment on the recall of sentences with various degrees 
of self-embedding clauses. Miller and Isard [1964] postulated a type of 
push-down memory for explaining their findings. Similarly, McNeill [1965] 
postulated three separate memory spans in explaining the findings of Fraser, 
et al [1963] and Ervin [1964]. For the present data, the writer postu- 
lated two separate memory spans in the k-limited transducer model. The 
model, as diagramed in Figure 4-1, uses two k-limited transducers in se- 
quence; the first one decodes each sentence into its deep structure and the 
second one applies the S.*s knowledge of English usage to assign a pronominal 
referent. Even though the present experiment was conducted in written form, 
memory limitations could still contribute to degenerative responding. The 
written form does not require the S to store each sentence in his immedi- 
ate memory for later processing. However, in order to select the most ap- 
propriate referent for the pronoun, the ^ must analyze each sentence to 
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obtain the subject-object relationship in the two clauses. The results 
of intermediate analysis must be stored so that the pronominal referent 
could be selected in accordance with conventional usage. 

The sequential arrangement of a decoding transducer, followed by an 
assignment transducer, was strongly influenced by the procedure generally 
accepted in the area of language processing by computer [Simmons. 1965; 

Hays, 1963; Lamb, 1965b], of a syntactic-analysis program followed by a 
semantic-analysis program. This arrangement was also suggested by the 
theoretical paper by Katz and Fodor [1963] , in which semantic rules are 
applied only after a complete syntactic analysis to obtain the meaning 
of a sentence. The decoder transducer corresponds to a syntactic analy- 
sis in the sense that only grammatical rules are applied to obtain the 
deep structure of a sentence while the assignment transducer corresponds 
(at least in part) to a semantic analysis in the sense that the knowledge 

of the English language is applied to select the most appropriate pro- 
nominal referent. 

The k and the k’ in the two transducers denote specific length 
of input strings. But sometimes we observe different responses when two 
sentences are of the same length, either by word oi morpheme count, as 

for example in John bejed Bill and he was gowed bv him vs . John punched 

ll lLanJ he was punched by him . Hence we hypothesize a decoding "p^^ogi^ani" 
that is concatenated with the sentence as an input string to the decoding 
transducer and an assignment "program" that is concatenated with the de- 
coded string (from the decoding transducer) as an Input string to the assign- 
ment transducer. The "program" reflects a level of difficulty that is 
caused by the level of syntactic complexity in a sentence and the level 
of association strength between the two verbs of a particular verb-pair. 

As mentioned in Chapter 4, there is some behavioral evidence from the re- 
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suits of Mehler [1963J, Mehler and Miller [1964], and Savin and Perchonock 
[1965] for including a decoding "program", and from Epstein [1961] and 
Rosenberg [1966] for including an assignment "program". 

The weakest part of this model is perhaps the serial ordering of the 
two transducers. The writer does not know of any behavioral evidence in 
support of the hypothetical ordering, that of syntactic analysis followed 
by semantic analysis, nor, indeed, does he believe that it ever could be 
shown to be true. A recent experiment by Hull and Gough [1966] attempted 
to test the hypothesis that a listener analyzes a sentence syntactically 
before he analyzes it semantically. But due to interactions between syn- 
tactic and semantic analyses, the authors concluded that their data did 
not support the hypothesis. While the ordering of the two analyses fa- 
cilitate both computer processing and theory construction, the human being 
is not necessarily constrained in this manner. 

The development of the information transmission model as presented 
in Chapter 4 is directly applicable to the data in contingency tables. The 
advantage of this model is that each of the design variables is treated as 
an Independent input source from which the relation between the individual 
input sources or any of these in combination, and the output source can be 
computed, buch a relation, measured by the T-terms (transmitted informa- 
tion terms), gives us the degree of predictability of the output response, 
l.e., the uniformity of responding in a group. 

We did not use all the design variables as input sources for the infor- 
mation transmission model, because the model computes an average value over 
the differtent values of a variable. For example, if the sentence types 
were included as in input source, then (along with the other variables) the 
transmitted Information terms could be computed. However, since the re- 
sponses to the Each other type were generally random and the responses to 
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the He Xed him and to the He Xed him back types were generally ’’diagonal”, 
then the average ot these three sentence types would not correctly repre- 
sent any sentence. 

The results of the present experiment demonstrated that _Ss could re- 
duce the number of interpretations of a key word in a sentence. In this 
we agree with the results of Kaplan’s experiment [1949]. But unlike Kaplan’s 
results, the present experiment showed that the resolution of the pronominal 
referent was not contingent on neighboring words. The parameters that could 
explain the different response matrices for the pronominal referent were 
the subject-object relationship and the two possible interpretations: ’’the 

sam«:i person performed both actions” and ’’each person performed one action”. 
The design variables of four grammatical forms, three sentence types and 
four classes of verb-pairs caused differences in response matrices due to a 
change in either the subject-object relationship or one of the two possible 
interpretations. The changes caused by the grammatical forms were obvious, 
since a passive construction alternated the subject-object relationship 
from the corresponding active construction, while a negative construction 
generally did not, since the subject— object relationship was invariant un- 
der the negative transformation. The sentence types could make one of two 
interpretations more probable, or neither one as in the Each other type. 

The changes caused by the different classes of verb-pairs might be attrib- 
uted to a class of semantic features to indicate a direction of action for 
all transitive verbs. At least three of these should be considered. There 
is an outward feature, exemplified by ’’hit”, ’’strike”, and a majority of 
transitive verbs, where the action is from the subject to the object; an 
3 .nward feature, exemplified by the first word of the logical reversal verb- 
pairs, ’’understand”, ’’remember”, etc., where the subject receives (or sen- 
ses) an action; and a reciprocal feature, exemplified by ’’meet”, ’’marry”, 
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etc., where the action between the subject and the object is reciprocated, 
i.e., A Xed B implies B Xed A . Thus for a given Interpretation, the re- 
sponse in which the first verb of the verb-pair haa the "inward" feature 
is expected to be opposite from one in which the first verb of the verb- 
pair has the "outward" feature, and in addition, the response in which the 
first verb of the verb-pair has the "reciprocal" feature is expected to 
be random. As a consequence, the dictionary for computer processing, using 
the English language, should contain such a class of semantic feature for 
the transitive verbs. 

There are parameters whicn laused degenerative (i.e., tendency toward 
random responding) behavior, any sentence from the Each other type evoked 
a random response. A syntactically complex sentence (involving the passive 
transformation) evoked less uniform responding than more simply constructed 
sentences. Changing from identical verb-pairs to the other classes of verb- 
pairs caused degeneration in ^s* responding, and a similar degeneration was 
observed when real verb-pairs were changed to nonsense verb-pairs. In gen- 
eral, deletion of the word "back" also caused some degeneration, especially 
with nonsense verb-pairs. Finally, the younger the £s, the more degenera- 
tive their responses; at the extreme, the youngest group, wigh an average 
age of 10.7 years, produced random responding to nearly all the sentences. 
All these degenerative responses can be explained with the k-limited trans- 
ducer model. If a sentence exceeds either the k units available for de- 
coding, or the k' units available for assigning the pronominal referent, 
or both of these, then that sentence evokes a random response. 

The results revealed that the fifth grade £s could not resolve pro- 
nominal ambiguity. Further experimentation with this age group may isolate 
the point of difficulty. Then by varying the verb-pair, simplifying the 
construction of the sentence anu/or providing some situational cues, such 
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as a short story of which the sentence is a part, it might be possible to 
detect the syntactic and semantic features needed in the process of re- 
solving pronominal ambiguity. Presumably these features are learned as 
S^s are more exposed to everyday language usage. Likewise these features, 
in addition to others, must necessarily be included in a computer program, 
if it is to simulate human language processing. 

Two developmental changes occurred sometime between the fifth and 
eighth grades: one in acquiring the semantic feature for "inward" and 

"outward" verbs, and the other in acquiring the constructional ambiguity 
when sentences using the "inward" verbs are expressed in the negative- 
affirmative constructions (i.e., sentences corresponding to the "deviant" 
cells in the logical reversal frames) . Since the responses from the fifth 
grade S_s were all random, it is safe to assume that these youngest £s 
could not distinguish the semantic feature for the "inward" and the "out- 
ward" verbs. On the other hand, the seventh grade S,s could make such a 
distinction, but the results were not conclusive on whether these ^s also 
acquired the constructional ambiguity. Thus with individual testing In 
this age range, where the experimenter could give additional cues, it might 
be possible to isolate the features that the S^s must necessarily learn 
before the distinctions were made. Again, for computer applications, such 
features must somehow be encoded in the program before it could claim to 
have any "intelligence". 

Finally, the results revealed that the sentences with nonsense verb- 
pairs ware responded to as if the verbs had the "outward" semantic feature. 
This is perhaps not surprising in view of the fact that most transitive 
verbs have this feature. However, it is an interesting empirical question 
whether this would still be true of the nonsense verbs if sentences con- 
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taining them were presented immediately following a set of sentences whose 
first verbs all have the "inward" feature, viz., the set of logical rever- 
sal verbs, or following a set of sentences whose first verbs all have the 
"reciprocal" feature. For sentences with identical verb-pairs, responses 
did not depend on whether or not the sentence included the word "back" , 
but for sentences with nonsense verb-pairs, the responses degenerated con- 
siderably when the word "back" was absent. If we were to test sentences 
when the two nonsensical verbs were identical, the present finding would 
predict no degeneration when the word "back" is absent. 
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Test Sentences 
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February 8, 1966 



Name: 

Age: years months 

Major area of study: 



INSTRUCTION 

In the following pages, you will find sentences which use the pronoun 
he to refer to either of two persons, John or Bill c Your task is to de- 
cide, for each sentence, the most appropriate referent or antecedent of the 
pronoun hce. Please mark an "x” on top of John or Bill as the referent of 
he . 

Please answer every sentence in the given order. Please write down 
the time in the given spaces. 



[Instruction sheet for the college group] 
[This is the first sheet of the? testing booklet] 



o 
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The following 2 pages contain sentences that have nonsense words in 
them. For example, 

John bejed Bill and he also gowed him. 

Your task is to decide whether the pronoun he refers to John or to Bill . 
Please do not substitute any real words for the nonsense words, but base 
your judgment upon the structures of the 2 clauses. Please then mark 
an "x" on top of that word. 



Start time: 
Finish time: 
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[This sheet precedes the part with nonsense verb-pairs] 
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The following 5 pages contain sentences that you can understand easily. 
Please read each sentence and decide whether the pronoun ha refers to John 
or to Bill . Then mark an "x" on top of that word. 



Start time: 
Finish time: 



[Instruction sheet for the college group] 



[This sheet precedes the part with real verb-pairs] 



m 



mum 





















139 

Name: 

Birthday: 



INSTRUCTIONS 

In the following pages, you will find sentences using the word he 
in them. For example: 

"John saw Bill and he called him." 

Some of you may think the word he means John and some others may think 
the word he means Bill . Your job is to decide, for each sentence, which 
person (John or Bill ) , goes with the word he » Please mark an "x" on top 
John or Bill , depending on which one you think it is. 

Please answer every sentence in the given order. 



[Instruction sheet for the children groups] 



[This is the first sheet of the testing booklet] 
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The following 5 pages contain sentences that you can understand. 
Please read each sentence and then decide on whether the word ^ means 
John or Bill. Then mark an "x" on top of that word* 





Start time: 
Finish time: 







[Instruction sheet for the children groups] 
[This sheet precedes the part with real verb-pairs] 
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The sentences on the following 2 pages have made-up words in them. 
For example, 

"John bejed Bill and he also gowed him." 

I don’t want you to guess some real words for the made-up words. Instead 
decide which name, John or Bill , goes with the word he by understanding 
the sentence with the made-up words in it. Then mark an x on top of 
that name. 



Start time: 
Finish time: 



[Instruction sheet for the children groups] 



[This sheet precedes the part with nonsense verb-pairs] 
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Bill did not tef John but he was cujed back by him. 

Bill and John did not qog each other and he was not tived by him. 

John waved Bill and he was keced back by him. 

Bill did not zab John but he jebed him. 

John kuwed Bill and he yimed him back. 

Bill and John were zoced by each other but he did not peh him. 

Bill and John did not kez each other but he was qemed by him. 

Bill yejed John but he did not koj him back. 

John and Bill were vuced by each other and he was yuged by him. 

Bill and John were not kuwed by each other and he did not luj him. 

Bill and John daqed each other but he was not tijed by him. 

Bill was zoced by John but he did not kej him back. 

John nijed Bill but he was not caqed back by himo 
Bill and John kajed each other but he did not peh him. 

Bill zoved John but he did not faj him. 

Bill was not vebed by John but he fubed him back. 

John cojed Bill but he was not wuhed by him. 

John kexed Bill and he was yaged by him. 

Bill did not buv John but he was ruqed by him. 

John was vided by Bill and he was gojed by him. 

John and Bill did not foh each other and he did not kuy him. 

Bill was not yuged by John and he did not mib him back. 

Bill was uob qiwed by John but he was pehed back by him. 

Bill was yuked by John but he was not miped back by him. 
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John was kaqed by Bill and he wuhed him. 

Bill did not guw John and he was not quved by him. 

John did not foh Bill but he vajed him back. 

Bill did not jex John and he did not weq him. 

John was qaped by Bill and he was tejed back by him. 

Bill and John caqed each other and he was jived by him. 

Bill was not zeced by John and he did not nij him. 

Bill was not guked by John and he was not vused by him. 

John and Bill were not fubed by each other but he was qosed by him. 

John and Bill were not jebed by each other but he sebed him. 

John and Bill were not vided by each other and he was not qoned by him. 

John and Bill did not gax each other but he lijed him. 

John was zeved by Bill and he pehed him back. 

Bill qeged John and he yeced him. 

John and Bill qaped each other and he tijed him. 

John was not zused by Bill but he was qiged by him. 

John was not yozed by Bill but he lijed him. 

John was not guhed by Bill and he was not vused back by him. 

Bill and John were zuted by each other but he was not kiyed by him. 

John and Bill were mibed by each other and he ruqed him. 

John did not quk Bill and he was not bejed back by him. 

Bill did not wib John and he did not jic him back. 

Bill was yoged by John but he was not kuyed by him. 

Bill was quped by John but he did not bij him. 
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John was teased by Bill and he was teased by him. 

Bill and John did not recognize each other and he did not Invite him. 
Bill understood John and he answered him back. 

John did not frighten Bill and he did not scare him back. 

Bill did not push John but he shoved him back. 

Bill was remembered by John but he was not phoned by him. 

Bill did not understand John and he was not answered back by him. 

Bill did not punch John and he did not punch him back. 

John pushed Bill but he was not shoved back by him. 

John did not recognize Bill and he did not Invite him. 

John was frightened by Bill and he scared him. 

John was not helped by Bill but he assisted him. 

Bill was not pushed by John but he was shoved by him. 

Bill and John heard each other but he was not called by him. 

Bill and John were not heard by each other but he vjas called by him. 
John and Bill did not hit each other and he was not kicked by him. 
John did not help Bill but he was assisted by him. 

John did not hear Bill but he called him. 

John was not remembered by Bill but he phoned him back. 

John did not hurt Bill but he hurt him. 

Bill did not help John but he was assisted back by him. 

Bill was understood by John and he was answered by him. 

Bill was heard by John but he did not call him back. 

Bill hit John and he kicked him. 

John was understood by Bill and he was answered back by him. 

John and Bill were frightened by each other and he scared him. 
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Bill did not tease John and he was not teased by him. 

John and Bill understood each other and he answered him. 

Bill was not struck by John but he struck him. 

Bill remembered John but he did not phone him back. 

John hurt Bill but he was not hurt by him. 

Bill pushed John but he was not shoved by him. 

Bill struck John but he did not strike him back. 

Bill was not remembered by John but he phoned him. 

John was heard by Bill but he did not call him. 

John was hurt by Bill but he did not hurt him. 

John was not hit by Bill and he did not kick him. 

Bill was not recognized by John and he was not invited back by him. 
Bill and John were not pushed by each other but he was shoved by him. 
John was recognized by Bill and he Invited him. 

John was punched by Bill and he punched him. 

Bill was not frightened by John and he was not scared back by him. 
Bill heard John but he was not called by him. 

John did not frighten Bill and he did not scare him. 

John did not hear Bill but he called him back. 

Bill and John did not help each other but he was assisted by him. 

John and Bill did not remember each other but he was phoned by him. 
Bill frightened John and he was scared back by him. 

Bill was struck by John but he was not struck by him. 

John and Bill were heard by each other but he did not call him. 

John was not hurt by Bill but he was hurt by him. 
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John was frightened by Bill and he scared him back. 

Bill was helped by John but he was not assisted by him. 

Bill was not punched by John and he was not punched by him. 

John was teased by Bill and he was teased back by him. 

John and Bill were not helped by each other but he assisted him. 

Bill and John were not remembered by each other but he phoned him. 

Bill did not punch John and he did not punch him. 

John did not recognize Bill and he did not invite him back. 

Bill and John were helped by each other but he was not assisted by him. 

John and Bill were hit by each other and he was kicked by him. 

John and Bill did not hear each other but he called him. 

Bill did not strike John but he was struck by him. 

Bill was not frightened by John and he was not scared by him. 

John heard Bill but he was not called back by him. 

John was helped by Bill but he was not assisted back by him. 

Bill and John pushed each other but he was not shoved by him. 

John and Bill were pushed by each other but he did not shove him. 

John hit Bill and he kicked him back. 

John recognized Bill and he was invited by him. 

John was hit by Bill and he was kicked by him. 

Bill understood John and he answered him. 

Bill was hurt by John but he did not hurt him back. 

Bill was not heard by John but he was called by him. 

John frightened Bill and he was scared by him. 

Bill hurt John but he was not hurt back by him. 

Bill was pushed by John but he did not shove him back. 
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John was remembered by Bill but he was not phoned back by him. 

John and Bill hit each other and he kicked him. 

Bill was not helped by John but he assisted him back. 

Bill did not tease John and he was not teased back by him. 

Bill did not hit John and he was not kicked by him. 

Bill remembered John but he did not phone him. 

Bill and John frightened each other and he was scared by him. 

John and Bill recognized each other and he was invited by him. 

John did not remember Bill but he was phoned by him. 

John did not hurt Bill but he hurt him back. 

John did not remember Bill but he was phoned back by him. 

Bill teased John and he teased him back. 

Bill was punched by John and he punched him back. 

John was pushed by Bill but he did not shove him. 

Bill was not punched by John and he was not punched back by him^ 

John and Bill were not frightened by each other and he was not scared by him. 

John and Bill were not recognized by each other and he was not invited by him. 

Bill did not push John but he shoved him. 

John was not understood by Bill and he did not answer him back. 

John helped Bill but he did not assist him. 

John punched Bill and he was punched back by him. 

Bill was struck by John but he was not struck back by him. 

Bill was not recognized by John and he was not invited by him. 

Bill did not understand John and he was not answered by him. 

John and Bill were understood by each other and he was answered by him. 

John was not teased by Bill and he did not tease him back. 
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Bill did not strike John but he was struck back by him. 

Bill was not heard by John but he was called back by him. 

John was not hurt by Bill but he was hurt back by him. 

Bill and John did not frighten each other and he did not scare him. 

Bill recognized John and he was invited back by him. 

Bill helped John but he did not assifst him back. 

Bill and John did not understand each other and he was not answered by him. 
John and Bill helped each other but he did not assist him. 

John and Bill were recognized by each other and he invited him. 

John and Bill remembered each other but he did not phone him. 

Bill teased John and he teased him. 

John and Bill were not understood by each other and he did not answer him. 

Bill struck John but he did not strike himo 

John was hit by Bill and he was kicked back by him. 

John was not struck by Bill but he struck him back* 

Bill was not hit by John and he did not kick him back. 

John was not understood by Bill and he did not answer him. 

John was not teased by Bill and he did not tease him. 

John and Bill were remembered by each other but he was not phoned by him. 
John and Bill did not push each other but he shoved him. 

Bill punched John and he was punched by him. 

Bill was recognized by John and he invited him back. 

Bill was not pushed by John but he was shoved back by him. 

Bill and John were not hit by each other and he did not kick him. 

John did not hit Bill and he was not kicked back by him. 
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Response Matrices and Analysis by Information Transmission 

Model for the Ninth Grade ^s. 
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Only li sale Sa for the nonsense verbs 
Starred cell (*) has p < .01 



Components of transmitted information for 14 male and 11 female ninth grade subjects. 
The average I.Q. scores for the male and female £s are 98 and 103 respectively. 
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APPENDIX C 



Chart Showing the Transmitted Information 
in an Ideal Group of n ^s. 
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